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G-PROTEIN COUPLED RECEPTORS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences ol G-protein coupled receptors 
5 and to the use o! these sequences in the diagnosis, treatment, and prevention ot cell proliteralive, 

neurological, cardiovascular, gastrointestinal, autoimmunc/inllammatory, and metabolic disorders, and 
viral iniecLions, and in the assessment of the ettccts of exogenous compounds on the expression of 
nucleic acid and amino acid sequences of G-protein coupled receptors. 

H) BACKGROUND OF THE INVENTION 

Signal transduction is the general process by which cells respond to extracellular signals. 
Signal transduction across the plasma membrane begins with the binding of a signal molecule, e.g., a 
hormone, neurotransmitter, or growth factor, to a cell membrane receptor. The receptor, thus activated, 
triggers an intracellular biochemical cascade that ends with the activation of an intracellular target 

15 molecule, such as a transcription factor. This process of signal transduction regulates all lypcs of cell 
functions including cell proliferation, differentiation, and gene transcription. The G-protein coupled 
receptors (GPCRs), enccxled by one of the largest families of genes yet identified, play a central role in 
the transduction of extracellular signals across the plasma membrane. GPCRs have a proven history of 
being successful therapeutic targets. 

20 GPCRs arc integral membrane proteins characterized by the presence of seven hydrophobic 

transmembrane domains which together form a bundle of antiparallel alpha (a) helices. GPCRs range 
in size from under 400 to over 1000 amino acids (Strost>erg, A.D. (1991) Eur. J. Biochem. 196:1-10; 
Coughlin, S.R. (1994) Curr. Opin. Cell Biol. 6:191-197). The aminoterminus of a GPCR is 
extracellular, is of variable length, and is often glycosylated. The carboxy-terminus is cytoplasmic and 

25 generally phosphorylated. Extracellular loops alternate with intracellular loops and link the 

transmembrane domains. Cysteine disultide bridges linking the second and third extracellular kx:>ps 
may interact with agonists and antagonists. The m(^st conserved domains of GPCRs are the 
iraasmcmbrane domains and the first two cytoplasmic l(x)ps. The transmembrane domains account, in 
pan, for structural and functional features of the receptor. In most cases, the bundle of a helices torms 

30 a ligand-binding pocket. The extracellular N-terminal segment, or one or more of the three extracellular 
IcKps, may also participate in ligand binding. Ligand binding activates the receptor by inducing a 
conformational change in inu-acellular pt^rtions of the receptor. In turn, the large, third intracellular 
loop of the activated receptor interacts with a heterotrimeric guanine nucleotide binding (G) protein 
complex which mediates further intracellular signaling activities, including the activation of second 
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me^ssengcrs such as cyclic AMP (cAMP), phospholipase C, and mositol triphosphate, and the 
interacUon of the activated GPCR with ion channel proteins. (Sec, e.g., Watson, S. and S. Arkinstall 
(1994) The G-nrotein Linked Receptor Facts Book . Academic Press, San Diego CA, pp. 2-6; 
Bolandcr. F.F. (1994) Molecular Endocrinoloiiv , Academic Press. San Diego CA, pp. 162-176; 
5 Baldwin, J.M. (1994) Curr. Opin. Cell Biol. 6:180-190.) 

GPCRs include receptors for sensory signal mediators (e.g., light and olfactory stimulatory 
molecules); adenosine, y-aminobutyric acid (GABA), hepatocyte growth factor, melanocorlins, 
neuropeptide Y, opioid pepudes, opsins, somatostatin, tachykinins, vasoactive intestinal polypepUde 
family, and vasopressin; biogenic amines (e.g., dopamine, epinephrine and norepinephrine, histamine, 
10 glutamaie (metabotropic effect), acetylcholine (muscarinic effect), and serotomn); chemokines; lipid 
mediators of inllammation (e.g., prostaglandins and prostanoids, platelet aclivaUng factor, and 
icukotrienes); and pcpude hormones (e.g.. bombesin, bradykinin. calcitonin, C5a anaphylatoxin, 
cndotheUn, follicle-sUmulaUng hormone (FSH), gonadotropic-releasing hormone (GnRH), neurokinin, 
and thyrotropin-releasing hormone (TRH). and oxytocin). GPCRs which act as receptors for stimuli 
15 that have yet to be identified are known as orphan receptors. 

The diversity of the GPCR family is further increased by alternative splicing. Many GPCR 
genes contain introns. and there are currently over 30 such receptors for which splice variants have been 
identified. The largest number of variaUons are at the protein C-terminus. N-terminal and cytoplasmic 
loop variants are also frequent, while variants in the extracellular \oops or transmembrane domains are 
20 less common. Some receptors have more than one site at which variance can occur. The splicing 
variants appear to be functionally disUnct, based upon observed differences m distribution, signaUng. 
coupling, regulation, and ligand binding profiles (Kilpatrick, G.J. et al. (1999) Trends Pharmacol. Sci. 
20:294-301). 

GPCRs can be divided into three major subfamilies; the rhodopsin-like, secretin-like, and 
25 metabotropic glutamate receptor subfamilies. Members of these GPCR subfamilies share similar 
functions and the characteristic seven transmembrane structure, but have divergent amino acid 
sequences. The largest family consists of the rhodopsin-like GPCRs, which transmit diverse 
extracellular signals including hormones, neurotransmitters, and light. Rh^xJopsin is a photosens.Uve 
GPCR lound in animal retinas. In vertebrates, rhodopsin molecules are embedded in membranous 
30 stacks found in photoreceptor (rod) cells. Each rhodopsin molecule responds to a photon of light by 
triggering a decrease in cGMP levels which leads to the closure of plasma membrane s<xlium chamicls. 
In this manner, a visual signal is converted to a neural impulse. Other rhodopsin-like GPCRs are 
directly involved in responding to neurotransmitters. These GPCRs include the receptors for adrenaline 
(adrenergic receptors), acetylcholine (muscarinic receptors), adenosine, galanin. and glutamate (N- 
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meihyl-D-aspariaie/NMDA receptors). (Reviewed in Watson, S. and S. Arkinstall (1994) The G- 
Pr(uein Linked Receptor Facts Book , Academic Press, San DiecoCA, pp. 7-9, 19-22, 32-35. 130-131, 
214-216, 221-222; Habert-Onoli, E. et al. (1994) Proc. Natl. Acad. Sci. USA 9 1 :9780-9783.) 

The galanin receptors mediate the activity of the neuroendocTine peptide gaianin, which 
5 inhibits secretion ol insulin, acetylcholine, serotonin and noradrenaline, and stimulates prolactin and 
growth hormone release. Galanin receptors are involved in feeding disorders, pain, depression, and 
Alzheimer's disease (Kask, K. et al. (1997) Life Sci. 60:1523-1533). Other nervous system rhodopsin- 
like GPCRs include a growing family of receptors for lysophosphatidic acid and other 
lysophospholipids, which appear to have roles in development and neuropathology (Chun, J. ct al. 

10 (1999) Cell Biochem. Biophys. 30:213-242). 

The largest subfamily of GPCRs, the olfaaory receptors, are also memba-s of the rhodopsin- 
like GPCR fan\ily. These receptors function by transducing odorant signals. Numerous distinct 
olfactory receptors are required to distinguish different odors. Each olfactory sensory neuron expresses 
only one type of olfactory receptor, and distinct spatial zones of neuroas expressing distinct receptors 

15 are found in nasal passages. For example, the RAlc receptor wliich was isolated from a rat brain 
library, has been shown to be limited in expression to very distinct regions of the brain and a defined 
zone of the olfactory epithelium (Raming, K. et al. (1998) Receptors Channels 6:141-15 1). However, 
the expression of olfactory-like receptors is not confined to olfactory tissues. For example, tfiree rat 
genes encoding olfactory-like receptors having typical GPCR characteristics showed expression 

20 patteriLs not only in taste and olfactory tissue, but also in male reproductive tissue (Thomas, M.B. et al. 
(1996) Gene 178:1-5). 

Members of the secretin-like GPCR subfamily have as their ligands peptide hormones such as 
secretin, calcitonin, glucagon, growth hormone-releasing hormone, parathyroid hormone, and 
vasoactive intestinal peptide. For example, the secretin receptor responds to secretin, a peptide 
25 hormone that stimulates the secretion of enzymes and ions in the pancreas and small intestine (Watson, 
supra , pp. 278-283). Secretin recq)tors are about 450 amino acids in length and are found in the 
plasma membrane of gastrointestinal cells. Binding of secretin to its receptor stimulates the production 
of c AMP. 

Examples of sccretin-like Gf*CRs implicated in inflammation and the immune response 
30 include the EGF module-containing, mucin-like hormone receptor (Emrl) and CD97 receptor 

proteins. TTiese GPCRs are members of the recently characterized EGF-TM7 receptors subfamily. 
These seven transmembrane hormone receptors exist as heterodimers in vivo and contain between 
three and seven potential calcium-binding EGF-like motifs. CD97 is predominantly expressed in 
leukocytes and is markedly upregulated on activated B and T cells (McKnight, A.J. and S. Gordon 
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(1998) J. Leukoc. Biol. 63:271 -280). 

The third GPCR subfan^ly is the nietabotropic glulamaie receptor family. Glulamate is the 
major exc.tatory neurotransnuuer ,n the central nervous .systenv The metabotrop.c glulamate receptors 
modulate the acuvUy ol mtracellular elfectors, and are involved m long-term potenliaUon (Watson, 
5 suEia, p. 1 30). The Ca^-sensmg receptor, which senses changes m the extracellular concentration ol 
calcum tons, has a large extracellular domain including clusters ol ac.dtc anuno acids which n.ay be 
involved in calcium binding. Tl.e metabotroptc glutamate receptor family also .ncludes pheromone 
receptors, the GABAb receptors, and the taste receptors. 

Other subfamilies of GPCRs include two groups of chemoreceptor genes found in the 
10 nematcxles m.norh.bditis ele.ans and raenorhabdius brl,^sae. which are distantly related to the 

mammalian olfactory receptor genes. The yeast pheromone receptors STE2 and STE3, involved in the 
response to mating factors on the cell membrane, have their own seven-traasmembrane signature, as do 
the CAMP receptors from the slime mold DicMelium^^ which are thought to regulate the 
aggregauon of individual cells and control the expression of numerous developmentally-regulated genes. 
,5 GPCR mutauoas. which may cause loss of function or constituuve activation, have been 

associated wiU. numerous human diseases (Coughlia suera). For instance, retinitis pigmentosa may 
arise from mutations in the rhodopsin gene. Furthermore, somatic activating mutatioas in the 
thyrotropin receptor have been reported to cause hyperfunctioning thyroid adenomas, suggesung that 
certain GPCRs susceptible to consututive activaUon may behave as protooncogenes (Parma. J. et al. 
.0 (1993) Nature 365:649-651). GPCR receptors for the following ligands also contain mutations 

asscKiated with human disease: luteinizing hormone (preccK.ous puberty); vasopressin V, (X-linked 
nephrogenic diabetes); glucagon (diabetes and hypertension); calcium (hyperparathyroidism, 
hypocalcuria. hypercalcemia); parathyroid hormone (short limbed dwarfism); ^-adrenoceptor (obesity, 
non-insulin-dependent diabetes mellitus); growth hormone releasing hormone (dwarfism); and 
adrenocorticotropin (glucocorticoid deficiency) (Wilsoa S. et al. (1998) Br. J. Pharmocol. 125:1387- 
1 392 Stadel. J.M. et al. (1997) Trends Pharmacol. Sci. 18:430-437). GPCRs are also involved m 
depre-ssion, schizophrenia, sleeplessness, hypertens.oa anxiety, stress, renal failure, and several 
cardiovascular disorders (Horn. F. and G. Vriend (1998) J. Mol. Med. 76:464-468). 

,n addition, within the past 20 years several hundred new drugs have been recognized that are 
,0 directed towards activating or inhibiUng GPCRs. The therapeutic targets of these drugs span a wide 
range of diseases and disorders, including cardiovascular, gastrointestinal, and centra, nervous system 
disorders as well as cancer, osteoporosis and endometriosis (W,ls<.n. sunra; Stadel. supra). For 
example the dopan.ne agonist L-dopa ,s used to treat Parkinson's disease, while a dopanune antagonist 
is used to treat schizophrenia and the early stages of Huntington's disease. Agomsts and antagonists ol 
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adremx;epiors have been used tor ihe ireaimeni of asthma, high blo^xj pressure, other cardiovascular 
disorders, and anxiety: muscarinic agorusts arc used in the treatment of glaucoma and tachycardia: 
serotonin 5HT1D antagonists are used agaiasi migraine: and histamine HI antagonists are used against 
allergic and anaphylactic reactions, hay fever, itching, and motion sickness (Horn, surra ). 
5 Recent research suggests pK)tential future therapeutic uses for GPCRs in the treatment of 

metabolic disorders including diabetes, obesity, and osteopc^trosis. For example, mutant V2 vasopressin 
receptors causing nephrogenic diabetes could be functionally rescued in vitro by co-expression of a C- 
terminal V2 receptor f>eptide spanning the region containing the mutations. This result suggests a 
possible novel strategy for disease treatment (Schoneberg, T. et al. (1996) EMBO J. 15:1283-1291). 

10 Mutations in melanocortjn-4 receptor (MC4R) are implicated in human weight regulation and obesity. 
As with the vasopressin V2 receptor mutants, these MC4R mutants are defective in trafficking to the 
plasma membrane (Ho, G. and R.G. MacKenzie (1999) J. Biol. Chem. 274:35816-35822), and thus 
nnight be treated with a similar strategy. The type 1 receptor for parathyroid hormone (PTH) is a 
GPCR that mediates the PTH-dependeni regulation of calcium homeostasis in the bloodstream. Study 

1 5 of PTH/receptor interactions may enable the development of novel PTH receptor ligands for the 
treatment of osteoporosis (Mannstadt, M. et al. (1999) Am. J. Physiol. 277:F665-F675). 

The chemokine receptor group of GPCRs have ix)tenlial therapeutic utility in inflammation and 
infectious disease. (For review, seeLocati, M. and P.M. Murphy (1999) Annu. Rev. Med, 50:425- 
440.) Chemokines are small polypeptides that act as intracellular signals in the regulation of leukocyte 

20 trafficking, hematopoiesis, and angiogenesis. Targeted disruption of various chemokine receptors in 
mice indicates that these receptors play roles in pathologic inflammation and in autoimmune disorders 
such as multiple sclerosis. Chemokine receptors are also exploited by infectious agents, including 
herpesviruses and the human immumxleficiency virus (HIV-1) to facilitate infection. A truncated 
version of chemokine receptor CCR5, which acts as a coreceptor for infection of T-cells by HlV-1 , 

25 results in resistance to AIDS, suggesting that CCR5 antagonists could be useful in preventing the 
development of AIDS. 

The discovery of new G-protein coupled receptors and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of cell proliferative, neurological, cardiovascular, gastrointestinal, 

30 autoimmune/inflammatory, and metabolic disorders, and viral infections, and in the assessment of the 
effects of exogenous compounds on the expression of nucleic acid and amino acid sequences of G- 
protein coupled receptors. 

SUMMARY OF THE INVENTION 
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The invenuon Icatures puntled polypepudes, G-proie.n coupled receptors, relerred to 
collecuvely as ' GCREC" and individually as "GCREC-l . " -GCREC-2/- ■'GCREC-3, ' ••GCREC-4.- 
••GCREC-5.- ••GCREC-6;' ' GCREC-T, - -GCREC-S, ' •GCREC-9/- "GCREC-IO, ' -GCREC-l 1. ' 
■ GCREC-12,- -'GCREC-13.''- GCREC-14; -'GCREC-15; 'GCREC-16, -''GCREC-17; - GCREC- 

18. - ••GCREC-19." '•GCREC-20." and '■GCREC-21 In one aspect, the mveniion provides an isolated 
poiypcpiide comprism. an amino ac.d sequence selected from the group consisUng ol a) an amino acid 
sequence selected from the group cons.sUng of SEQ ID NO: I -21 . b) a naturally cxcurnng amino acd 
sequence having at least 90% sequence identity to an amino aad sequence selected from the group 
consisting of SEQ ID NO.1-21, c) a biologically acUve fragment of an amano acid sequence selected 
irom the group consisung of SEQ ID NO:l -21 . and d) an immunogenic fragment of an ammo acid 
sequence selected from the group consisting of SEQ ID NO: 1-21. In one alternative, the .nvenuon 
provides an isolated polypeptide comprising the ammo acid sequence of SEQ ID NO: 1 -2 1 . 

The mvention further provides an isolated polynucleotide encoding a polypeptide comprising an 
ammo acid sequence selected from the group consisung of a) an an^no acid sequence selected from the 
eroup consisting of SEQ ID NO:l-21, b) a naturally occurring amino acid sequence having at least 
907. sequence idenuty to an amino acid sequence selected from the group consisUng of SEQ ID NO: 1 - 
2 1 . c) a bioloaically active fragment of an amino acid sequence selected from the group consisung ol 
SEQ ID NO:l-21, and d) an inmiunogemc fragment of an amino acid sequence selected from the group 
consisung of SEQ ID NO:l-21. In one alternative, the polynuclcoUde encodes a polypeptide selected 
Irom the group consisting of SEQ ID NO:l -21 . In another alternaUve, the polynucleoude is selected 
from the croup consisting of SEQ ID NO:22-42. 

Additionally, the invenlion provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleoude encoding a polypepUde comprising an ammo acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
.s consisting of SEQ ID NO: 1 -21 . b) a naturally occurring amino acid sequence having at least 90% 

sequence idenuty to an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -2 1 , c) 
a biologically acUve fragment of an amino acid sequence selected from the group consisting ot SEQ ID 
NO 1 -2 1 and d) an immunogenic fragment of an anuno acid sequence selected from the group 
consisung of SEQ ID NO: 1-21. In one alternaUve. the invention provides a cell transformed with the 
30 recombinant polynucleotide. In another alternaUve, the invenuon provides a traasgenic organism 
comprising the recombinant polynucleoUde. 

The invention also provides a methcxl for prcxlucing a polypeptide compnsmg an amino acid 
sequence selected from the gr<.up consisung of a) an ammo acid sequence selected from the group 
consisung of SEQ ID NO:l-21. b) a naturally occurring amino acid sequence having at least 90% 
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sequence identity to an amino acid sequence selected from the group consisting ot SEQ ID iNO: 1-21, c) 
a biologically active fragment of an amino acid sequence selected Irom the group consisung of SEQ ID 
NO: 1-21, and d) an inuiiunogemc fragment of an amino acid sequence selected imm the group 
consisting of SEQ ID NO:l-21. The melh<xJ comprises a) culturing a cell under conditions suitable for 
5 expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleoude 
comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 
recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 

10 sequci.ee selected from the group consisting of SEQ ID NO: 1-21, b) a naturally occurring amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -2 1 , c) a biologically active fragment of an anuno acid sequence selected 
from the group consisting of SEQ ID NO: 1-21, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-21. 

15 The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 

selected from the group consisting of a> ^ polynucleotide sequence selected from the group consisting of 
SEQ ID NO:22-42. b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:22-42, c) a 
polynuclecHide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 

20 an RN A equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group coasisting of 

25 SEQ ID NO:22-42, b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:22-42, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RN A equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleoude 

30 in the sample, and which probe spccitlcally hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target polynucleotide or 
fragments thereof, and b) detecting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 
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The mvenUon ft^rther provides a method for detecting a target polyaucleoude m a sample, said 
tar.et polynucleotide having a sequence o. a polynucleoude comprising a polynucleotide sequence 
serected from the group consisung o. a) a polynucleoude sequence selected Irom the group cons.sung o. 
SEQ ID NO.22-42. b) a naturally occurrmg polynucleoude sequence havmg at least 90% sequence 
5 .denuty to a polynucleotide sequence selected from the group cons.sung of SEQ ID NO:22-42, c) a 

p<,lvnucleoude sequence complementary to a), d) a polynucleoude sequence complementary to b), and e, 
an RN A equivalent oi a)-d). The method compr.ses a) amplifying said target polynucleoude or 
fragment thereof using polymerase chain reacuon amplificaUon, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and. optionally, if present, the 
10 amount thercot. 

The mvenuon further provides a composiuon comprising an effecuve amount of a polypepude 
comprusmg an anuno ac.d sequence selected from the group coasistmg of a) an an.no acid sequence 
selected from the group coas.sting of SEQ ID NO:l-21 , b» a naturally occurring amino acid sequence 
having at least 90% sequence idenuty to an amino acid sequence selected from the group cons.sung of 
,5 SEQ ID NO;l-21. c) a biologically active fragment of an amino acid sequence selected from the group 
coasisting of SEQ ID NO:l-21 . and d) an .nmunogen.c fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1 -21 . and a pharmaceutically acceptable excip.ent In one 
embodiment, the composition comprises an an^no acid sequence selected from the group cons.sung of 
SEQ ID NO l-21 The invention addiuonally provides a method of treaUng a disease or cond.uon 
20 associated with decreased expression of lx.nctional GCREC. comprising adtninistering to a pauent m 
need ol" such treatment the composiuon. 

The invenuon also provides a method for screening a compound for effectiveness as an 
agonist Of a p<.lypeptide comprising an amino acd sequence selected from the group cons.sung oi a) an 
an.no acid sequence selected from the group consisUng of SEQ ID NO:l-21. b) a naturally occurring 
amin<i acid sequence having at least 90% sequence identity to an amino acid sequence selected Irom the 
group consisung of SEQ ID NO. 1 -2 1 , c) a biologically active fragment of an ammo ac.d sequence 
selected from the group consisung of SEQ ID NO:l -21 , a,id d) an .mmunogciic fragment of an a.nmo 
acid sequence selected from the group coasisung of SEQ ID NO:l -21 T^e method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting agon.st act.v.ty m the 
.0 sample In one alternative, the invenuon provides a composition comprising an agon.st compound 
identified by the methcxl and a pharmaceutically acceptable excip.ent. In another alternauve, the 
invention provides a method of treaUng a disease or cond.tion associated witli decreased expression ol 
functional GCREC. compr.s.ng adm.ni.ster.ng to a patient .n ,ieed of such ueatment the c<,mpos.uon. 
Addiuonally, the invention provides a method for screening a compound for effectiveness as 
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an antagonist ot a polypeptide comprising an amino acid sequence selected from the group consisting 
ot a) an anruno acid sequence selected Irom the group consisting ol SEQ ID NO: 1-21. b) a naturally 
occurring amino acid sequence having at least 90% sequence idenuiy to an amino acid sequence 
selected from the group consisung of SEQ ID NO: 1-21, c) a biologically active fragment of an anuno 

5 acid sequence selected from the group consisting of SEQ ID NO:l-2K and d) an immunogenic fragment 
of an amino acid sequence selected from the group consisung of SEQ ID NO: 1-21. Hie methcxl 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a composition comprising 
an antagonist compx)und identified by the metliod and a pharmaceutically acceptable excipient. In 

10 anoihei alternative, the invention provides a method of treating a disease or condition associated with 
ov erexpression of functional GCREC, comprising administering to a patient in need of such treatment 
ihe composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 

15 acid sequence selected from ihe group consisting of SEQ ID NO:l-21, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-21, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-21, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-21. The methcxl comprises a) combining 

20 the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
oi the polypeptide to the test compound, thereby identifying a compx)und that specifically binds to the 
polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 

25 amino acid sequence selected from the group consisting of SEQ ID NO: 1 -21 , b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1 -2 1 , c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-21, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-21. The method comprises a) 

30 combimng the polypeptide wiUi at least one test compound under conditions permissive for the 
acuvity of the polypeptide, b) assessing the activity of the polyf>epiide in the presence of the test 
compound, and c) comparing the activity of the polypeptide in the presence of the test compound with 
the activity of the polypeptide in the absence of the lest compound, wherein a change in the acuvity ol 
the polypeptide in the presence of the test compound is indicative of a comp>ound that m(xlulates the 
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acuvity of the polypeptide. 

Tnc .nvenuon further prov.dcs a method tor screening a compound lor effectiveness m 
a,ter.ne expression of a target po.ynucleoude, wherem sa.d target p<,. ynuc.eot.de comprises a 
sequence selected from the group cons.sung of SEQ ID NO:22-42, the metf^od co.npr.stng a) 
5 expos-ng a sample comprts.ng t.e target polynuc.eot.de to a compound, and b) detecung altered 
expression of Ihe target poLynudeolide. 

The mveation h^rther provides a tnemod lor a.sessl„g to^ctty of a test compound, said 
ntet^txl eon,pnsi„g a, treattng a „,o,ugiea, sanrpfe containing nuCeic aeids wit. the lost contpound, 
0, hybridizing the nue.ele adds of the treated hlofoglcal satnp.e with a probe conrprlsing at leas, 
,„ eonU^uofS nucleotides of a polyntideoUd. co.pns,hg a po.ynucleoddc sequence selecved fron, the 
eroup consisung o, ■) a polynticleoUde sequence selected from the group consisting o, SEQ ID 

22-42 ii, a naturally occurring poly„uc,e.>nde sequence having at least 90* sequence ideiiuty to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:22-42, ill) a 
polynucieoude sequence complementary to i). iv) a polynudeotide sequence contplementary to i,). 
,5 and V) an RNA equivalent of i)-iv). Hybridization occurs under condiUons whereby a spechc 
hybridization complex is formed between said probe and a target polynuCeodde ,n the biological 
sample, s^d targe, polynuCeodde composing a polynuCeoUde sequence -'^-^^'" '^TJ^^ 
consisung of i) a polynucfeoude sequence selected from the group consisting of SEQ ID NO 22-4., 
ii) a naturally occurring polynudeotide sequence having at le»>t 90% sequence idenhty to a 
polynucleoUde sequence selected fr.>m *e group consisting of SEQ ID NO:22-42, Hi) a 
polynucleotide sequence compiememary to , ,, iv) a polynucleotide sequence complementary to it), 
and V, an RNA equivalent of ,)-iv), AhernaUv.ly, the target polynucleotide comprises a fragmem ot 
a polynudeodde sequence selected from the group consisting of i)-v) above, ci quandfyng the 
amount o, hybridization complex; and d) comparing the amount of hybndizadon complex in the 
,5 treated biological sample wiU, the amount of hybridization complex in an „nm=a,ed biologtca, 

sample, wherein a difference in tbe amount of hybridization complex in the treated biological sample 
IS indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenc.aturc for the full length polynuc.e..,de and po.ypepude 

sequences ot the present invention. 

Table 2 shows the GenBank .dentillcauon number and annotation of the nearest GenBank 
homo.og for polypepUdes of the invention. The probability score for the match between each 
polypeptide and its GenBank homo.og is also shown. 

Tab.e 3 shows structural features of polypeptide sequences of tf.e invention, including predicted 
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moufs and domains, along with ihe methods, algorithJiis, and searchable databases used for analysis ot 
the fx)typeptides. 

Table 4 lists the cDN A and genomic DNA tragments which were used to assemble 
polynucleotide sequences of the invention, along with selected tragments ot the polynucleotide 
5 sequences. 

Table 5 shows the representative cDNA library tor polynucleotides ot the invention. 
Table 6 provides an appendix which describes ttie tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
10 polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRlFflON OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 

that this invention is not limited to the particular machines, materials and methods described, as these 
15 may vary. It is also to be understood that the terminology used herein is for the purpose of describing 

particular embodiments only, and is not intended to limit the scope of the present invention which will 

be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms ''a,*' ''an," 

and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
20 reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 

reference lo one or more antibodies and equivalents thereof known to those skilled in the art, and so 

forth. 

Unless detined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understcxxl by one of ordinary skill in the art to which this invention belongs. Although 

25 any machines, materials, and methods similar or equivalent to those described herein can be used to 

practice or test the present invention, the preferred machines, materials and mcth(xis are now described. 
All publicauoas mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
prouxols, reagents and vectors which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 

30 not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"GCREC" refers to the amino acid sequences of substantially purified GCREC obtained ln)m 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

11 
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The term -agonist" relers lo a molecule which iniensitles or niiniics the biological acuv.ty ol 
GCREC. Agonists may mclude proteins, nucleic acids, carlx^hydrates, small molecules, or any other 
compound or composiuon which modulates the acuvity ot GCREC either by directly interacting with 
GCREC or by aclinc on components ol the biological pathway in wt^ch GCREC participates. 

5 An "allelic variant" .s an alternative form ot the gene encoding GCREC. Allelic variants may 

result from at least <.ne mulaUon m the nucleic acid sequence and may re^sult in altered niRNAs or in 
polypeptides whose structure or funcuon may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally cKCurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deleuons, additions, or substitutions of nucle.>udes. 

10 Each of these types of changes may occur alone, or in combination with the others, one or more times in 
a given sequence. 

• Altered" nucleic acid sequences encoding GCREC include those sequences with deletions, 
inseruons, or subsUtuUons of different nucleotides, resulting in a polypeptide the same as GCREC or a 
polypeptide with at least one tlincuonal characteristic of GCREC. Included within this delinition are 
,5 polymorphisms which may or may not be readily detectable using a particular oUgonucleoude probe of 
the polynucleotide encoding GCREC. and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding GCREC. 
The encoded protein may also be "altered," and may contain deletioas. insertions, or substituUons of 
amino acid residues which produce a silent change and result in a functionally equivalent GCREC. 
20 Deliberate amino acid subsututions may be made on the basis of similarity m polarity, charge, 

solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 
biological or immunological acUvity of GCREC is retained. For example, negauvely charged amino 
acids may include asparUc acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity 
25 values may include: asparagine and glutamine; and serine and threonine. Amino acids with 

uncharged side chains having similar hydrophilicity values may include: leucine, isoleuc.ne, and 
valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms - amino acid" and ' amino acid scciucnce" refer to an oligopeptide, pepude, 
polvpcpude, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
30 molecules. Where - amino acid sequence" is recited to refer to a sequence of a naturally (x:curring 

protein molecule, -amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

-Amplification" relates to the production of additional copies of a nucleic acid sequence 
Amplificauon is generally carried ou, using polymerase chain reaction (PCR) technologies well known 
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in the art. 

The term "antagonist * relers to a molecule which inhibits or attenuates the biological activity ol 
GCREC. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of GCREC either by 
5 directly mteracting with GCREC or by acting on components ot the biological pathway in which 
GCREC participates. 

The term "antibody'' relers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, FCab')., and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind GCREC polypepudes can be prepared using intact polypeptides or using 

10 fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 

oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translaUon of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 

15 the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of aniib<.xiies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 

20 protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune resf)onse) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(c(xling) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA: 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 

25 phosphorolhioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-meihoxyethyl sugars or 2'-mcthoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5 -methyl cyiosine, 2'-dcoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 

30 nucleic acid sequence prcxluced by the cell to form duplexes which block cither transcription or 
u-anslation. The designation "negative" or "minus" can refer to the antisense strand, and ihc 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term ^'biologically active" refers to a protein having structural, regulatory, or biochemical 
funcuons of a naturally occurring molecule. Likewise, "immunologically active" or ^^immunogenic" 
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relcrs to Ihe capability of the natural, recombinant, or synthetic GCREC. or of any oligopepude thereof, 
to induce a specific inimune response in appropriate animals or cells and to bind with specific 
antibodies. 

• Complementary- describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5 -AGT-3' pairs with its complement. 
3'-TCA-5'. 

A "composiuon comprising a given polynucleotide sequence" and a •■composition comprising a 
given amino acid sequence" refer broadly to any composiuon containing the given polynucleoUde or 
amino acid sequence. The composiuon may comprise a dry fc^rmulaUon or an aqueous soluUon. 
ComposiUons comprising polynucleotide sequences encoding GCREC or fragments of GCREC may be 
employed as hybridizauon probes. The probes may be stored m freezc-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl). detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

• Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosyslems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GELVIEW fragment assembly system (GCC, Madison WT) or Phrap 
(Umversity of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

•'Coaservative amino acid substitutions" are those subsUtutioas that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
coaservative amino acid substitutions. 

Original Residue Conservative Substitution 

Ala Gly, Scr 

Arc 

Asn ^^^P' ^''^ 

Asp AS"- 

(2ys Ala, Ser 

Gin Asn, Glu, His 

Q1^J Asp, Gin, His 

Gly Ala 

His Asn, Arg, Gin, Glu 

[le Leu, Val 
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Leu He, Val 

Lys Arc, Gin, Glu 

Mel Lou, He 

Phc His, Mel, Leu, Trp. T>t 

Ser Cys, Thr 

Thr Scr, Val 

Trp Phe, Tyr 

Tyr His, Phe, Trp 

Val He, Leu, IThr 



Conservative anuno acid substitutions generally mainiain (a) the structure ot the polypepude 
backbone in the area ol ihe substitution, tor example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobiciiy ol ihe molecule at the site of the substitution, and/or (c) the bulk of ihe 
side chain. 

15 A ^'deletion" refers to a change m the amino acid or nucleotide sequence that results in ihe 

absence of one or more amino acid residues or nucleotides. 

The term ''derivative" refers to a chemically mtxlified fK)lynucleotide or polypeptide. Chenucal 
modifications of a polynucleotide can include, for example, replacement of hydrogen by an alkyl, acyU 
hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains ai least one 

20 biological or immunological function of the natural molecule. A derivative polypeptide is one modified 
by glycosylation, pegylation, or any similar process that retains at least one biological or immunological 
function of the polypeptide from which it was derived. 

A ''detectable labeV refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalenily joined to a polynucleotide or polypeptide. 

25 A "fragment" is a unique portion of GCREC or the polynucleotide encoding GCREC which 

is identical in sequence to but shorter in length than the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/aniino acid residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A 
fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at leasi 

30 5, 10, 15, 16,20,25,30,40,50,60,75, 100, 150, 250 or at least 500 contiguous nucleotides or 
amino acid residues in length. Fragments may be preferentially selected from certain regions of a 
molecule. For example, a polypeptide fragment may comprise a certain length of contiguous amino 
acids selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypepude as shown 
in a certain defined sequence. Clearly these lengths are exemplary, and any length that is supported 

35 by the specification, including the Sequence Lisung, tables, and figures, may be encompassed by the 
present emb<xliments. 

A fragment of SEQ ID NO:22-42 comprises a region of unique polynucleotide sequence that 
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spccilically identifies SEQ ID NO:22-42, for example, as distinct Irom any other sequence m the 
genome frotn which the fragment was obtained. A fragment of SEQ ID NO:22-42 is useful, for 
example, m hybnd.zauon and ampliflcauon technologies and in analogous methods that distinguish 
SEQ ID NO:22-42 from related polynucleotide sequences. The precise length of a fragment of SEQ 

5 ID .NO:22-42 and the region of SEQ ID NO:22-42 to which the fragment corresponds arc routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO:l-21 is encoded by a Iragmem of SEQ ID NO.22-42. A fragment 
of SEQ ID NO; 1-21 comprises a region of unique ani.no acid sequence that specifically idcnufies 
SEQ ID NO:l-21, For example, a fragmcm of SEQ ID NO:l-21 is useful as an immunogenic peptide 

10 for the development of anUbodies that specifically recognize SEQ ID NO:l-21. "Hie precise length of 
a fragment of SEQ ID NO:l-21 and the region of SEQ ID NO:l-21 to which the fragment 
corresponds are routinely detcrmmaWe by one of ordinary skill m the art based on the intended 

purpose for the fragment. 

A - lull length ' polynucleotide sequence is one containing at least a translation iniUaUon codon 
,5 (e.g., methionine) followed by an open reading frame and a translaUon termination codon. A ' full 
length" polynucleoude sequence encodes a - full length" polypeptide sequence. 

"Homology" refers to sequence similarity or. interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% idenUty," as applied to polynucleoUde sequences, refer to 
20 the percentage of residue matches between at least two polynucleoUde sequences aligned using a 

standardized algorithm. Such an algorithm may insert, m a standardized and reproducible way, gaps in 
the sequences being compared in order to opUmize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent idenuty between polynucleoUde sequences may be determined using the default 
25 parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3 . 1 2e sequence 
aliuranent program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis proerams (DNASTAR. Madison WI). CLUSTAL V is described in Higg.ns, D C. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higg.ns. D.G. et al. (1992) CABIOS 8.189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows. 
^0 Ktuple=2, eap penaliy^S, window=4, and "diagonals saved"=4. The "weighted" residue weight table .s 
selected as the default. Percent idenuty is reported by CLUSTAL V as the "percent sinular.ty " between 
aligned polynucleotide sequences. 

AltcrnaUvely, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology InformaUon (NCBI) Basic Local Alignment Search 
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Tool (BLAST) (-Alischul, S.F. ci al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBl, Bethesda, MD, and on the Internet at 

htlpV/www.ncbi.nlm.nih.eov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a kjiown polynucleotide sequence with other 
5 polynuclecuide sequences Irom a variety of databases. Also available is a tool called * BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
10 compare two nucleiHide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
15 Open Gap: 5 and Extension Gap: 2 penalties 

Gap X drop-off: 50 

Expect: JO 

Word Size: 11 

Filter: on 

20 Percent identity may be measured over the length of an entire defined sequence, for example, as 

defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 2(X) contiguous nucleotides. Such 
lengtfLs are exemplary only, and it is understood that any fragment length supported by the sequences 

25 shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic ccxle. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 

30 that all encode substantially the same protein. 

The phrases "percent identity" and identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
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explained in more detail above, generally preserve the charge and hydrophob.ciiy al the sue of 
substituuon, thus preserving the structure (and therefore ftincuon) ol the polypeptide. 

Percent idenUty between p<Mypept.de sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence alignment 

5 program (described and referenced above). For pairwise alignments of polypeptide sequences usmg 
CLUSTAL V, the default parameters are set as follows: Kiuple= 1 . gap pcnalty=3, window=5. and 
■ diagonals saved"=5. The PAM250 maUix is selected as the default residue weight table. As w.th 
polynucleoUde alignments, the percent idenUty is reported by CLUSTAL V as the "percent smulanty" 
between aligned polypeptide sequence pairs. 

1 0 Alternau vely the NCBI BLAST software suite may be used. For example, for a pairwise 

coruparKson ot two polypepude sequences, one may use the ^'BL AST 2 Sequences'^ tool Version 2.0. 1 2 
( Apr.] -2 1-2000) with blastp set at delault parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: II and Extension Gap: I penalties 
1 5 Gap X drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an enure defined polypeptide sequence, for 
20 example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for mstance, 
a fraement of at least 15. at least 20, al least 30, at lea.st 40. at least 50. at least 70 or at least 150 
contieuous residues. Such lengths are exemplary only, and it is understcxxl that any fragment length 
supported by the sequences shown herein, m the tables, figures or Sequence LisUng, may be used to 
25 describe a length over which percentage identity may be measured. 

• Human artificial chromosomes" (HACs) are linear micrcxhromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term -humanized antibody" refers to an antibody molecule in which the anuno acid 
30 sequence in the non-antigen binding regions has been altered so that the anUbody more closely 
resembles a human antibody, and still retains its original binding ability. 

•■Hybridization" refers to the process by which a polynucleoUde strand anneals w.lh a 
complementary strand through base pa.nng under defined hybridization conditions. Specific 
hybridizauon is an indication that two nucleic acid sequences share a high degree of complementarity . 
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Specific hybridization complexes form under permissive annealine conditions and remain hybridized 
atier ihe "washing" sicp(s). The washing step(s) is parucularly imporiani in deiernuning the stringency 
ol the hybridization prixess, wiih more stringent conditions allowing less non-specific binding, i.e.. 
binding between pairs ol nucleic acid strands that are not perfectly matched. Permissive conditions for 
5 annealing of nucleic acid sequences are routinely determinable by one ol ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 6H''C in the presence of about 6 x SSC, about \ Vv (w/v) 
SDS, and about 1(X3 ^g/ml sheared, denatured salmon sperm DNA. 

10 Generally, stringency of hybridization is expressed, in pan, with reference to the temperature 

under which the wash step is carried out. Such wash temperatures are typically selected to be about 
S^'C to 21TC lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T^ is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T^^^ and conditions 

15 for nucleic acid hybridization arc well known and can be found in Sambrook, J. et al. (1989) Molecular 
Cloning: A Laboratory Manual , 2''^ ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68X in the presence of about 0.2 x SSC and about (). 1 % SDS, for 1 hour. 

20 Alternatively, temperatures of about 6()°C, 55°C, or 42"C may be used. SSC concentration may 

be varied from about 0, 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 1(X)-2(X) pg/ml. Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 

25 such as for RNA:DNA hybridizatioas. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 

30 sequences by virtue of the formation oi hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., CqI or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
or their nucleic acids have been fixed). 
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The words '-.aserUon" and -addition ' refer to changes m an amino ac.d or nucleoUde sequence 
resulun. m the addition ot one or more ammo acd residues or nucleoudes, respectively. 

' immune response" can reler to conditions associated with intlammation, trauma, muiiune 
disorders, or intecuous or acneUc disease, etc. These condiuons can be characterized by expression o. 
5 various factors, e.g.. cytokines, chemokines, and other signaling molecules, which may affect cellular 

and sysieinic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopepude fragment of GCREC which is 
capable of eliciting an intmune respoase when .ntr<xluced into a living organism, for example, a 
mantntal The term "immunogenic fragment" also includes any polypepude or oligopepude fragment of 
0 GCREC which ,s useful in any of tfie antibody production methcxls disclosed herein or known in the art. 

The term -nucroarray" refers to an arrangement of a plurality of polynucleoUdes, polypeptides, 

or other chemical compounds on a substrate. 

The terms ' element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined posiuon on a microarray. 
15 The term -modulate" refers to a change in the acuvity of GCREC. For example, modulation 

may cause an increase or a decrease in protein acuvity, binding characterisUcs, or any other biological, 
functional, or immunological properties of GCREC. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nuclcoude, oligonucleotide, 
po,ynucle<,ude, or any fragment thereof. These phrases also refer to DN A or RN A of genon.c or 
.0 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PN A), or to any DN A-like or RN A-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
aincuonal relaUonship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcripuon or expression ol the ccxling 
25 sequence. Operably Unked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the sanie reading frame. 

' Peptide nucleic acid" (PNA) refers to an anUsense molecule or anU-gene agent which 
comprises an oUeonucleotide of at least about 5 nucleotides in length linked to a peptide backbone o. 
amino acid residues ending in lysine. The terminal lysine conlers solubility to the composiUon. PN As 
30 preferentially bind complementary single stranded DNA <,r RN A and stop trai^cript elongation, and 
may be peaylaied to extend their lifespan in the cell. 

- Post-translational modincation" of an GCREC may involve l.pidation, glycosylation. 
phosphorylation, acetylation, racemi.ation. proteolytic cleavage, and other modifications known in the 
art These processes may occur synthetically or b.ochen.cally. Biochemical mcxlifications will vary by 
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cell t\pe depending on ihe enzymatic niilieu of GCREC. 

"Probe" rctcrs to nucleic acid sequences encoding GCREC, iheir complements, or fragments 
thercol, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleoudes attached lo a detectable label or reporter molecule. Typical 

5 labels include radioacUvc isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 

short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplillcaUon (and idenufication) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

10 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order lo enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, HX), 
or ai least 150 consecuuve nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supp<.)rted by the 

1? specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. el al. (1989) Molecular Cloning: A Laboratory Manual , 2""^ ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, P.M. et al. (1987) Current Prot(x:ols in Molecular 
Biology . Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 

20 ProtcKols, A Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Ohgonucleotides for use as primes are selected using software known in the art for such 
25 purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
30 West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequencer 
and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a ^^mispriming library,'^ in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of ohgonucleotides for 
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,n,cn.arTays (The s(,urce ccxle lor the latter two pr.mer sdecdon programs may also be obtained from 
the>r rcpccuve sources and modified to meet the user's specUlc needs.) The PruncGen program 
(available to the publ.c from the UK Human Genome Mappmg Project Resource Centre, Cambridge 
UK) das.uns primers based on mult.ple sequence alignments, thereby allowmg selection of prmiers that 

5 hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence this program is useful for identificaUon of both unique and coaserved oligonucleotides and 
polynucleotide fraements. The oligonucleotides and polynucleotide fragments idenufied by any of the 
above selection methods are usehil in hybridizaaon ta:hnolog.es. for example, as PCR or sequencing 
primers microarray elements, or specific probes to identify hilly or partially complementary 

,0 polynucleoudes in a sample of nucleic acids. Methods of oligonucleotide selection are not hmited to 
those described above. 

A - recombinant nucleic acid " is a sequence that is not naturally occurring or has a sequence 
that is made by an aruficial combination of two or more otherwise separated segments of sequence. 
This artificial comb.nauon is often accomplished by chemical synthesis or. more commonly, by the 
,5 artificial manipulation of isolated segments of nucleic acids, e.g.. by genetic engineering techniques 
such as those described in Sambrook, The term recombinant includes nucleic acids that have 

been altered solely by addition, subsUtution, or dcleUon of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence, 
such a recombinant nucleic acd may be part of a vector that is used, for example, to transform a cell. 
.0 Al.ernauvely. such recombinant nucleic acids may be part of a viral vector. e.g., based on a 

vaccima virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A -regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
reeions of a eene and includes enhancers, promoters, introns. and 5' and 3' untranslated regions (UTRs). 
Re^gulatory elements interact with host or viral proteins which control transcription, translation, or RNA 



25 

Stability. 



Reporter molecules" are chemical or bux.hem.cal moieue.s used for labeling a nucleic acid, 
ammo acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chcmiluminescent. or chromogenic agents; substrates; cofactors; inhibitors; magneUc paruc.es; and 

30 (Hher moieties known in the art. 

An -RNA equivalem." in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all CKcurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of nbose 
instead of deoxyribosc. 
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The term "sample" is used in its broadest sense. A sample suspected of containine GCREC, 
nucleic acids encoding GCREC, or fragments thereof may comprise a txxlily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell: a cell, genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 
5 The terms "specific binding" and "specifically binding" refer to that intcracuon between a 

protein or peptide and an agonist, an antibody, an aniagonisi, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
10 A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount ol labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
15 they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
20 microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound, 

A "traascript image" refers to the collective pauern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 

"Transformation" descTibes a process by which exogenous DNA is introduced into a recipient 
25 cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the an, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryoiic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being traasformed and may include, but is not limited to, bacteriophage or viral iniecuon. 
cleclroporation, heal shock, lip(^fection, and particle bombardment. The term "transformed cells" 
30 includes stably transformed cells in which the laserted DNA is capable of replication either as an 

auu^nomously replicating plasmid or as part of the host chromosome, as well as iraasiently traasforme^l 
cells which express the inserted DNA or RNA for limited pericxis of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
ammals and plants, in wfiich one or more of the cells of the organism contains heterologous nucleic 

23 
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acid mu-oduced by way of human intcrvenUon. such as by transgenic techniques well known in the 
an. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
01 the cell, by way of deliberate gcneUc manipulation, such as by microinjccuon or by mfecuon with 
a recombinant virus. The term geneuc manipulation docs not include classical cross-brccding, or m 

5 vitxo fertihzauon, but rather is directed to the mtroducUon of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria. fungi, planus and ammals. TTie isolated DNA of the present invenuon can be 
introduced into the host by methods known in the art. for example inlecuon, transfection, 
transformation or transconjugation. Tcchmques for transferring the DNA of the present invention 

10 into such organisms are widely known and provided in references such as Sambrook el al. ( 1 9K9), 
supra . 

A -variant- of a particular nucleic acid sequence is dclined as a nucleic acid sequence having at 
least 4()% sequence idenUty to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
15 set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%. at least 80%, at least 85%. at least 90%. at least 95% or at least 98% or greater 
sequence idenUty over a certain defined length. A variant may be described as. for example, an '-allelic" 
(as defined above), "'splice," "spec.es." or 'polymorphic" variant. A splice variant may have sigmlicant 
idenUty to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
20 due to alternauve splicing of exons during niRN A processing. The corresponding polypeptide may 
possess addiuonal functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulung 
polypepUdcs will generally have significant amino acid idenUty relaUve to each other. A polymorphic 
variant is a variaUon in the polynucleoUde sequence of a particular gene between individuals of a given 
25 species. Polymorphic variants also may encompass "single nucleoUdc polymorphisms" (SNPs) in 
which the polynucleoUde sequence varies by one nucleoUde base. The presence of SNPs may be 
indicative of, for example, a certain populaUon. a disease state, or a propensity for a disease state. 

A - variant" of a parUcular polypcpUde sequence is defined as a polypepUde sequence having al 
least 40% sequence idenUty to the particular polypeptide sequence over a certain length ol one of the 
30 polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 ( May-07- 1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%. at least 90%. at least 95%. or at least 98% or greater sequence 
idenUty over a certain defined length of one of the polypepUdes. 
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THE INVENTION 

The invenuon is based on the discovery oi new human G-protein coupled receptors (GCREC), 
the polynucleoudcs cnccxling GCREC, and the use of these coniposiuons lor the diagnosis, treatment, or 
prevention ol cell proliferative, neurological, cardiovascular, gastromtestinal, 

5 autoimniune/inflammatory, and metabolic disorders, and viral infections. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 
single Inc>te project idcntificauon number (Incyie Project ID). Each polypeptide sequence is denoted 
by bcHh a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incytc 

10 polypeptide sequence number (Incyte Polypeptide ID) as shown. Each p<:>lynucleoUdc sequence is 

denoted by both a polynucleotide sequence idcnufication number (Polynucleotide SEQ ID NO:) and an 
Incytc polynucleotide coasensus sequence number (Incyie Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

15 polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyie Polypeptide ID) for polyp>eptides of the invention. Column 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 
Column 4 shows the probability score for the match between each polypeptide and its GenBank 
homolog. Column 5 shows the annotauon of the GenBank homolog along with relevant citations where 

20 applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invenUon. Columns 1 and 2 
show the polypeptide sequence identificaUon number (SEQ ID NO:) and the corresponding Incyte 
polypepude sequence number (Incyte Polypepfide ID) for each polypeptide of the invenuon. Column 3 
shows the number of amino acid residues in each polypeptide. Column 4 shows potential 

25 phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 
program of the GCG sequence analysis software package (Genetics Computer Group, Madison WI). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Column 7 
shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods were applied. 

30 As shown in Table 4, the full length polynucleoUde sequences of the present invenuon were 

assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 
combinaUon of these two types of sequences. Colunuis 1 and 2 list the polynucleotide sequence 
idcnufication number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invenuon. 
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Column 3 shows the length of each polynucleoude sequence ,n basepa.rs. Colunm 4 l.sts fragments of 
the polynucleoude sequences which are useful, for example, in hybnd.zauon or ampl.tlcaUon 
technologies that identify SEQ ID NO:22-42 or that disuneu.sh between SEQ ID NO:22-42 and 
related polynucleoude sequences. Colunin 5 shows .denutlcaUon numbers corresp<.nd.ng to cDNA 
5 sequence., coding sequences (exons) predicted from genetic DNA, and/or sequence assemblages 
comprised of both cDN A and genon^c DNA. These sequences were used to assemble the hiU length 
polynucleotide sequences of the invenUon. Columns 6 and 7 of Table 4 show the nucleotide start (5 ) 
and .stop ( 3') positions of the cDNA and genomic sequences in column 5 relaUve to their re.specuve hill 
lentith sequences. 

0 " The identificaUon numbers in Column 5 of Table 4 may refer .specifically, tor example, to 
mcyte cDN AS along with their corresponding cDNA libraries. For example. 5080262H 1 is the 
.dentificauon number of an Incyte cDNA sequence, and LNODNOTl 1 is the cDNA library from which 
i, is derived. Incyte cDNAs for which cDN A libraries are not indicated were derived from pooled 
CDNA libraries (e.g.. SBSA02572V1). Alternatively, the identification numbers in column 5 may refer 
,5 to GenBank cDNAs or ESTs (e.g., g4589483_CD) which contributed to the assembly of the full length 
polyt^ucleotide sequences. Alternatively, the idenUficauon numbers in column 5 may refer to c.xl.ng 
regions predicted by Gcnscan analysis of genomic DNA. For example. GNN.g5902227_03().edit is the 
.dentification number of a Genscan-predicted coding sequence, with g5902227 being the GenBan. 
identification number of the sequence to which Genscan was applied. The Genscan-predicted coding 
.0 sequences may have been edited prior to assembly. (See Example IV.) AlternaUvely, the identificauon 
numbers in column 5 may refer to assemblages of both cDNA and Genscan-predicted exons brought 
together by an "exon stitching" algorithm. (See Example V,) Alternatively, the idenufication numbers 
m column 5 may refer to assemblages of both cDN A and Genscan-predicted exons brought together by 
an -exon-stretching" algorithm. (See Example V.) In some cases. Incyte cDNA coverage redundant 
25 With the sequence coverage shown in column 5 was obtained to confirm the final consensus 
polynucleotide sequence, but the relevant Incyte cDN A idenufication numbers are not shown. 

Table 5 shows the representaUve cDN A l.branes for those full length polynucleotide sequences 
which were assembled us.ng Incyte cDN A sequences. The representative cDNA library iS the Incyte 
CDNA library which is most frequently represented by the Incyte cDN A sequences which were used to 
30 assemble and confirm the above polynucleotide sequences. The tissues and vecu.s which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses GCREC variants. A preferred GCREC variant is one which 
has at least about 80%. or alternatively at least about 90%. or even at least about 95% amino ac.d 
sequence idenuty to the GCREC amino ac.d sequence, and which contains at least one functional or 
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Structural charactensUc oi GCREC. 

The invenuon also encompasses polynuc]et:)Udes which encode GCREC. In a particular 
embodiment, the invenuon encompasses a polynucleoude sequence comprising a sequence selected trom 
the group consisting of SEQ ID NO:22-42, which encodes GCREC. The polynucleotide sequences of 

5 SEQ ID NO:22-42, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein cxxurrcnces of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence enccxiing GCREC. In 
particular, such a variant polynucleotide sequence will have al least about 70%, or alternatively at least 

10 about 85%, or even ai least about 95% polynucleotide sequence identity to the polynucleotide sequence 
enccxiing GCREC. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group coasisting of SEQ ID NO:22-42 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:22-42. 

15 Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contaias at least one functional or structural characteristic of GCREC. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding GCREC, some bearing minimal similarity to 
the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 

20 invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations arc made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
occurring GCREC, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode GCREC and its variants are generally capable of 

25 hybridizing to the nucleotide sequence of the naturally occurring GCREC under appropriately selected 
conditions of stringency, it may be advantage(^us to produce nucleotide sequences encoding GCREC or 
its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
(x:curring codons. C<xlons may be selected to increase the rale at which expression of the peptide 
(xcurs in a particular prokaryotic or eukaryoiic host in accordance with the frequency with which 

30 particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence enccxiing GCREC and its derivatives without altering the encoded amino acid sequences 
include the pnxiuction of RNA traascripts having more desirable properties, such as a greater half-life, 
than transcripts produced from the naturally (x;curring sequence. 

The invention also encompasses production of DNA sequences which encode GCREC and 
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sequence and e.p,oy,ng varrous PCR*a n, ^^^^^ ^^^^^^ 

— oa.p,,.u^^^ 

restncuon-site PCK, uses univcisai a , . ,^ T 

,c«. p ^ Sarkar G (1993) PGR Melhtxis Appl.c. 2.318-322.) 
ONA wiOun a c,„n.„g ve«„^ (S«, - ^ - J „ „„.„„w„ 

,„ Auomcr u,er.-, "^- '"^^IZ^ -e.p,are ,s derived ,r«n resrneUon ,ragn.n.s co.pr,s,ng a 
sequence from a e,rcu,ar,zed,en,p,a>e, Thc,c^ T„,„a, T. e. a,. < 19W NuCac Acd, 

^"T"r:rR :r>^R an,;„r,canon o, PNA ,rag,ne„.s ad,ace„, 
Res. ,6:8,86.) A third n«.h.^, capture PGR, mvCves P 
,0 ,.„w„ setiue^^ .n huntan and yeast arttfca, chronosome DN A (See, e.g., _ 
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(1991) PCR Methods Applic. 1:111-1 19.) In this mclhixl, niuluple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other meth^xls which may be used to retrieve unknown sequences 
are known m the art. (See, e.g., Parker. J D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
5 Addilionally, one may use PCR, nested prmiers, and PROMOTERFINDER hbraries (Ciontcch, Palo 
AJto CA) to walk genomic DNA. This prcx:edure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 primer analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
10 GC content of about 50% or more, and to anneal to the template at temp>eratures of about 68''C to 
72"C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 

15 docs not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are conrunercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR prcxJucts. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 

20 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the enure 
process from loading of samples to computer analysis and electronic data display may be computer 
cc^niroUcd. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 

25 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments tfiereof which 
encode GCREC may be cloned in recombinant DNA molecules that direct expression of GCREC, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
the gcneuc code, other DNA sequences which encode substantially the same or a functionally equivalent 

30 amino acid sequence may be produced and used to express GCREC. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter GCREC-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, pr(x:essing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
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■"•r::=: rr ^^^^ 

, Q ,n,^ riara CA- described in U.S. Patent Number 
.sMOLECULARBREEDlNG(M.xygen Inc.. Santa Clara CA,desc 

5 X37 458- Chang C -C. et aL (1999) Nat. B.otechnol. 17:793-797; Chnsuans, F.C. et al. ( .999) 
5,837,458, Chang. C. Biotechnol. 14:315-319) to alter or 

Bu>tecbnol. 17:259-264; and ^ ^^^^^^^^ ,,UvUy or .ts abUUy 

..,ove btolo^cal prope.cs or ^^^^^^^^^^^^^^ / ^^^^^ , ,ene 

,o bind to other molecules or compounds. DN A shulHrn, 

H usine PCR-mediated recombinaUon of gene Iragments. The library 

^ no rrREC may be synthesized, in whole or in part, 
,„ ,„„^ e,„^™e„,. ...e.. c^o.. OCREC ^_ ^^^^^^^ 

r K,„„ T ( 1984) rr--' ' .,nd Mol ^'ilar WH Fr=<man. 

:::c z . .-.^ .... .^^^^ 

^0 seuuencc ot a naturally occurring polypeptide. 

" " T.epep.demaybesubstanuallypur.n.bypreparauve.ghpc_ 

. ranhv (Sec c g Chiez, R.M. and F.Z. Regmer (1990) Methods Enzymol. 182.39_ 4.1 .) 
chromatography. (Sec. eg , C ^^^^ ,,^-,n.ed by amino acid analysis or by sequencing. 

The composition ol' the synthetic peptides may be conlirmea oy 
(Sec, e.g.. Creighton, supra , pp. 28-53.) 
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In order to express a biologically active GCREC, the nucleoude sequences encoding GCREC 
or denvauves thereof may he inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements tor transcriptional and iraaslauonal control ol the inserted coding 
sequence m a suitable host. Tlicse elements include regulatory sequences, such as enhancers, 
5 constiiuiive and inducible promoters, and 5' and 3' untranslated regioas in the vector and in 

polynucleotide sequences encoding GCREC. Such elements may vary in their strength and specitlciiy. 
Specific imtiauon signals may also be used to achieve more etticient translation of sequences encoding 
GCREC, Such signals include the ATG iniLiation codon and adjacent sequences, e.g. the Kozak 
sequence. In cases where sequences enccxling GCREC and its initiaiion ccxlon and upstream regulatory 

10 sequences are inserted into the appropriate expression vector, no additional transcriptional or 

translational control signals may be needed. However, in cases where only coding sequence, or a 
fragment thereof, is inserted, exogenous translational control signals including an in-frame ATG 
imtiauon codon should be provided by the vector. Exogenous translational elements and initiation 
codons may be of various origins, both natural and synthetic. The efficiency of expression may be 

1 5 enhanced by the inclusion of enhancers appropriate for the particular host cell system used, (See, e.g., 
Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Meth(xLs which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding GCREC and appropriate traascriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 

20 and in vivo genetic recombination. (See. e.g., Sambrook, J. et al. (1989) Molecular Cloning, A 

Laboratory Manual , Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, P.M. et 
al. (1995) Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
25 enccxling GCREC. These include, hut are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors: insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., caulillower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
30 animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 
91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Thcr. 7:1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-311; The McGraw Hill Yearbook of Science and Technology ( 1992) McGraw Hill, New 
York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and 
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(1994) Mol. Immunol. 3l(3).^i^ z^o, ^mu 

.,.,o„„. .nc propa,.Uo„ or polynuCeoUde se.ue^ c„co>.n. CCREC can 

ac PRl UESCRIPT (Stratagene, La Jolla CA) or r:>r^^rv h 

-.^^ scA^ '^'lOQ^ When lari;equanuues ol GCRfcL arc neeucu, ^ f 
264:5503-5509.) When la q ^^^^^^^ ^^^^^^ 

vectors which direct high level expression ot GCREC may oe 

.e strong, SP6 or T7 hac«e promoter may ^c us^^ ^^^^^^ 

Yeast expression systems may be used for producuon of GCREC. A num 

,^r. «,.rh as alDha factor, alcohol oxidase, and POH 
contalmng co^sU.uUve or Inducible p-omotcs. such as alpha ^^^^ 

p,„.o„s. .na, be used In ^ yea. S.«=....~ « "t^and enable ,n,e„a.on 
Lo-s diiec. eun. .ec.eUon „ l^racel.la. .e.enUo„ o, e.p,e^^^ P^e,-^ ^ 
„, ,„,e,.„se.uenc»,n.o«.»s, genome fo,s.blep,op..au„n^(S«,e..,Au»-^^ 

B,„c. O A. e. al. <,987> MeUK>l. Enzymol^ .53.516.544; a«l Scorer, C.A. al. (199 ) 

B,ofTectinologyl2,l«l-184.) „„,„gcrEC Transcription of sequences 

Plan, s,s,e.s .ay also be ns. for cxp,«s.o r CCREC. ^ _ ^^^^ 

cncr^ng GCREC niay be driven b, vrral promoiers. e,g me J ^^^^ , 

„„„eor ,„co„,binari„n w,.U»o.e,a lea*r se^neace '^^-J^^^—^^l^,,^, 

(1984, science 224.838-843, ,,,,sformauon or pathogen-med.atcd 

constructs can be mtroduced into plant cells by direct DNA 

..nsfeCon. <See, e.g., mM^QO^imV.^^ 
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New York NY. pp. 191-196.) 

In mammalian cells, a number ol viral-bascd expression systems may he utJIized. In cases 
where an adenovirus is used as an expression vector, sequences enccxiing GCREC may he ligaied mto 
an adenovirus transcription/translation complex consislmg of the late promoter and tripartite leader 
5 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to c^btain 

intective virus which expresses GCREC in host cells. (See, e.g., Logan* J. and T. Shenk (1984) Proc, 
Natl. Acad. Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to incTease expression in mammalian host cells. SV4() or EBV- 
hascd vectors may also be used for high-level protein expression. 
10 Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

DN A than can be contained in and expressed from a plasnud. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycalionic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J J. et al. (1997) Nat. Genet. 15:345-355.) 
For long term production of recombinant proteias in mammalian systems, stable expression of 
15 GCREC in cell lines is preferred. For example, sequences encoding GCREC can be u-aasformed into 
eel! lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
intrcxJuction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
20 selective agent, and its presence allows growth and recovery of cells which successfully express the 
mtroduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
25 genes, for use in /A:* and ^pr cells, resp>ectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; 
Lowy, 1. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pal confer resistance to 
chlorsulfuron and phosphinotricin acetyltransfcrase, respectively. (See, e.g., Wigler, M. et al. (1980) 
30 Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbcre-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and insD, which alter cellular requirements 
for metabcMites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Pr(x:. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green lluorescent proteias (GFP; Clontcch), B 
glucuronidase and its substrate I5-glucuromde, or luciferase and its substrate luciferin may be used. 

33 
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enc^n, OCREC .s ,.e„e. w„«„ . 7„1„. „„„.o.a. a 

4 r-r-PFr can be identified by the absence of marker gene mui-u 
sequences encoding GCREC can be ^ 

specihc ixilyclon radioimmunoassays (RlA-s), and tluorescence 

acuv^ied cell sorung (FACS). A mos. ,„mpe„Uve bWng 

20 .mlbcdics reactive lo ,w„ non-lmerlerlng epitopes on GCREC pr te . ^ 

a... .a, .e en,plo,e.^ T.ese ana o,^ a.a. ^ Z. Col.an. I B. 

" '^-^^^ Wileylnlerscle^. Ne. 

....„el.ona..a.co*.aon..«.e«^ 
n.av be used in various nucleic acid and amino acid assays. Mean P 
or PGR probes .or detecting sequences related to po.ynuc.eoUdes encc^ng GCREC inCud^ 

, .nn ..belin. or PCR ampl.ticaUon using a labeled nuclcoude. 
olieolabcling, nick translation, cnd-label.nfe, or PCK p 

li^arrRFC oranYtracmentsihcrcol.maybecioncaimua 
Alicrnaiively. the sequences encoding GCREC. or any . commercially available, 

- =:r— ^=^^^^^^^^ 

-h ...T7 T3 or SP6 and labeled nucleoudes. These procedures n y 
.Maoison W.). a„a .S B,oc..„.cal. Su.laMe ,ep»,e, „K„«u,es c laPels .^c. n«> « u.«i 
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ot detection include radionuclides, enzymes. Huorescent, chenuluminesccnt, or chromogcruc agents, as 
well as substrates, cofaciors, inhibitors, magnetic particles, and ihe like. 

Host cells transtormed with nucleotide sequences encxxling GCREC may be cultured under 
ct)ndiuons suitable lor the expression and recovery ot the protem from cell culture. The protein 
5 produced by a transformed cell may be secreted or retained iniracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
p<^lynucleoudes which encode GCREC may be designed to contain signal sequences which direct 
secretion of GCREC through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression ol the 

10 inscned sequences or to process the expressed protein in the desired fashion. Such mcxlifications ot the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylauon, 
lipidadon, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also tx^ used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 

15 (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 

Collection ( ATCC, Manassas VA) and may be chosen to ensure the correct modilicauon and processing 
of the foreign protein. 

In anothCT embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding GCREC may be ligated to a heterologous sequence resulting in translation of a 

20 fusion protein in any of tf>e aforementioned host systems. For example, a chimeric GCREC protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of GCREC activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 

25 binding protein (MBP), thic^-edoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-mvc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutatfuone, maltose, phenylarsinc oxide, calmodulin, and metal-chelaie resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 

30 these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 

located between the GCREC encoding sequence and the heterologous protein sequence, so that GCREC 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch, 10). A variety of conuiicrcially 
available kits may also be used to facilitate expression and purification of fusion proteins. 
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,„ . ,„,U,e, en„x«, o, >he ,nve„,lo„. syn.hes,. ,„ ,aa,o,aWed CCREC may ,« »h,evc<. 
v,„o „sl„, ,« TNT ,*.U «lcu,>Ky,. « w... g^m c.„ac, sys.cn, <Pr.,.e,a,. 
;;;,e ,rans..p„o„ an. .anslaUon of p,o,e,n.c«.ns s^ucnccs operab,, as««a,ed w„. the T7 T3, „ 
SP6 p,„n,o,e,s T,ans.a„o„ ,akcs p,ac= ,„ .he prccncc o. a ,a«abe>cd anuno acd precursor, ,or 

example, ''S-meihionine. 

OCREC 0, mc prescn, m.enUon „r .r.gn,en« may be used .o screen ,.,r co„,p.,u„ds 

,ba, specncw bind .o GCREC. A. ,eas, one and up ,0 a pluralKy o, ,es. compounds n,ay be 
screened lor specific binding ,o GCREC. Examples o, .es, compounds indude a„.ib.^,e,s, 
.,ll„onucleoUd=.s. proieins (eg, receplors). or small molecules. ,,,,,^„t 

' ,n one emb.Klime„,. ibc compound ,bus IdenUf.ed is closely -elared to ,bc na.ura, l.gand 
OCRHC. c.g . a ligand or Iragmcm rbcreol. a narural subsuate. a slr^cural or funcUonal numcuc or a 

i. 6 ,,.,.|,. rurri-nl pr—- "i- Inimunology U2). 

naiural bmding partner. (See. e.g.. Coligan. J.E. et al. (1991) QmcnLCI — r7:„„ 

Chap.er 5 , Similarly, Uie comround can be closely relaied .0 .he naiural recepior ,o wb.cb GCREC 
bmds. or ,0 a, least a Iragmem o, the receptor, e.g.. tbe ligand binding site. ,n either case, the 
, compound can be raUonally designed using .nown tecbni,ues. In one em«nt. screemng^r 
les compounds involves producing approp.ate cells which express CCREC. cther as a seer ted 
prl i or on the cell membrane. Prelened cells include cells from mammals, yeast, o E 

cells e^pressmg GCREC o, cell membrane .raeUons .bleb 
;;b . test compound and binding, sUmulaUon. or InMbitton of acUvity o, either GCREC 

" """":':::::s,mply,es,bindi„gofatestcompou„dtothepol,peptide,.here,nb.„d.ng,s 
detec,edh,a.luorc.h„re,radioisotope,en.ymeconlugate. or other detectablelab^^^^^ 
U,e assay may comprise the steps of comb,mng at least one test compound with CCREC, CKhcr 
soluuon or affixed to a solid suppor. and detecUng the binding of GCREC to the com^und. 

,5 Alterna..vely, the assay may detect or measure binding o, a test compound in the presence of a 

2cd c„mU«^ A«..ona,ly, .e assay may be carHed out using cell-ree preparaUons, cbemic 
iLes, or natural product ituxtu.es, and .e test compounds) may be free ,n soluUon or atftxed ,o 

" gTrEC or tbc present mvenUon or fragmems ihereo, may ^ used to screen for compounds 
,„ tbat modulate the acUvity of GCREC. Such compounds may include agomsts. amagomsts. or pan.a. 
or inverse a.m.sts. Inone embtxhrnem. an assay .s perform^ under c„n<«U„ns permissive ,o 
OCREC acuvitv. wherein GCREC is combined with at least one test c„mpc«.n4 and the acuvity o, 
CCREC .n the presence of a test compound iS con^.r«. wtb the activity of GCREC in the absence ol 

• fr-r-ppr in The presence ot lhe test compound IS 
the test compound. A change in the acuvuy of GCREC in the presen 

* f r-rPFr Aliernativelv, a test compound is 
mdicauve of a compound thai modulates the actwtty of GCREC. Alternau >, 
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combined with an in vitro or cell-free system comprising GCREC under conditjons suitable tor GCREC 
activity, and the assay is performed. In either of these assays, a test compound which modulates the 
acuvity of GCREC may do so indirectly and need not c(>me in direct contact with the test compound. 
At least one and up to a plurality of lest compounds may be screened. 

5 In another embodiment, polynucleotides encoding GCREC or their mammalian homologs may 

be "kn(x:ked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease. (See, e.g., U.S. Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo 

10 and grown in culture. The ES cells are transformed with a vector containing the gene of interest 

disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capccchi, M.R. (19H9) 
Science 244: 1 288- 1 292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination lakes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specitic manner (Manh, J.D. 

15 (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjecied into mouse cell blastocysts such as those from 
the C57BLy6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and the 
resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus gena-aied may be tested with potential therapeutic or toxic agents. 

20 Polynucleotides enccxling GCREC may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES cells have the potential to diOereniiate into at least eight separate cell 
lineages including endodcrm, mesixlerm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A, et al. ( 1998) 
Science 282:1145-1 147). 

25 Polynucleotides encoding GCREC can also be used to create "knockin" humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding GCREC is injected into animal ES ceils, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulac, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 

30 potential pharmaceutical agents to obtain inibrmation on treatment of a human disease. Alternatively, a 
mammal inbred to ovcrexpress GCREC, e.g., by secreting GCREC in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
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1,. a. ™a, ,„,«,„n. , = .ea,p,o„, „■ .».c. -—2:—" ,^ 

CCREC express,o„ o, ac.vHv. n . Ces.raMc .o decease .eexp,ess,on o, -vn, o, GCREC. „ 
l„e. o'.so..c. assoc,a,e. wO, a^rease. OC^C ex.c.s,„n o, ac.v„„ „ ,s *s„..e 
increase the expression or acuviiy of GCREC. 

^e-eL ,n one en,^in,en.. GCREC or a .ra,.e„, or .e.vaUve .ereo, .av .c 
aan,..s,ere. ,0 a s.«« ,0 uca, o, preven. a .so*r .ss...,e. w,. *crease. 

, rCREC Examples 01 such disorders ..Klude. but are no. limited ,o, a cell proh.erauvc 

. :rer::aL~^^^ 

eonnecuvc ussue disease (MCTD), n.ydofi^ros.s, paroxys.a. nocturnal heniog.o.nuna, 

K era osonasis pnmary thrombocythemia. and cancers mc.udmg adenocarcinoma, 
polycythemia vera, psoriasis, pnm y and. in particular, cancers of 

leukemia, lymphoma, melanoma, myeloma, sarcoma, teraioc 

IS saslroinlestinal tract, heart, taaney, iivci, m 5, , ^. . ^ e„,.h 

Llvary .la^ds, s.n. spleen, lesUs. mymus, d,yro,d, and «™s. a „e„rolo.ea, d— ^ a 
, Tv ischennc cerebrovascular disease, sToke, cerebral neoplasms, Alzbcme, s disease. 

30 rcunius pigmentosa, beredKary araxias, mnlup.e ^l""- ,„^^^„^^, 
ana vlra, memng.ris, brain abscess, subdural empyema, ep.dural abscess supp 
«pb,ebids. myeuus and radlcullbs. virai central nervous system d,.,ease, P™ 
I^udin Kum Creut.feld.-Ja.ob disease, and Cerstmann-Suaussier-Scbetnlcer syndrome, raial 

lal nsomnia, nu,ri.,o„a, and me,abo,ic disease., o, .be nervous system. 
. : s sclerosts. cerebelloreUna. bemang,oblastom..t,s,s. -P-«r,gen.n. .n^ro^^^^ 
rcardauon and o.ber developmema, disorders o, .be cernr. nervous system.^* P-^ 

===== 

^^^^ 

rrr:===— ^^^^ 
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tumors, complicaUons ol thrombolysis, balkx)n angioplasty, vascular rcplacemeni, and coronary 
ancry bypass grali surgery, congestive heart failure, ischemic heart disease, angina pectoris, 
myocardial mtarciion, hypertensive hean disease, degenerative valvular heart disease, calcihc aortic 
valve stenosis, congemtally bicuspid aortic valve, mitral annular calcification, mitral valve prolapse, 

5 rheumauc fever and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic 

endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, 
myocarditis, pericarditis, neoplastic hean disease, congenital heart disease, and complications of 
cardiac transplantation; a gasu-ointestinal disorder such as dysphagia, peptic esophagitis, esophageal 
spasm, esophageal stricture, esophageal carcinoma, dyspepsia, indigestion, gastritis, gastric 

10 carcinoma, anorexia, nausea, emcsis, gastroparesis, antral or pyloric edema, abdominal angina, 
pyrosis, gastroenteritis, intestinal obstruction, infections of the intestinal tract, peptic ulcer, 
cholelithiasis, cholecystitis, cholestasis, pancreatitis, pancreatic carcinoma, biliary tract disease, 
hepatitis, hyperbilirubinemia, cirrhosis, passive congestion of the liver, hepatoma, infectious colitis, 
ulcerative colitis, ulcerative proctitis, Crohn's disease, Whipple's disease, Mallory-Weiss syndrome, 

15 colonic carcinoma, colonic obstmciion, irritable bowel syndrome, short bowel syndrome, diarrhea, 

constipation, gastrointestinal hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropathy, 
jaundice, hepatic encephalopathy, hepatorenal syndrome, hepatic steatosis, hemochromatosis, 
Wilson's disease, alpha,-antitrypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, 
liver infarction, portal vein obstruction and thrombosis, centrilobular necrosis, peliosis hepatis, 

20 hepatic vein thromt>osis, veno-occlusive disease, preeclampsia, eclampsia, acute fatty liver of 

pregnancy, intrahepatic cholestasis of pregnancy, and hepatic tumors including ncxlular hyperplasias, 
adenomas, and carcinomas; an autoimmune/inflammatory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory disu-ess syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 

25 hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodcrmal 
dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episexlic lymphopenia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 
syndrome, gout. Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel 

30 syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardial inllammation, 

osteoartftrilis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumat(^id 
arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerauve colitis, uveius, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 

35 parasitic, protozoal, and helminthic infections, and trauma; a metabolic disorder such as diabetes, 
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■ .nd an intecuon by a viral agent classU.ed as adenovirus, arenavirus, 
obcsiiy. and osteoporosis, and an intccUon y ^,,^,,v,rus llav, virus, orthomyxovirus, 

. ^^^^^ 

ti-.i porrier mav be administered to a subject to treai u v 

aon^msicred ,o a s„bj«. u, fea. or p-evem a J,sor*r assoc.a 
,„ a rurto e„,bodimem. a„ amagon,s, of GCREC ™y 

^^^^ 

.astrointcsuna,. autoimmune/tnllammatory, ^^^^^^ 
^ n. GCREC ntay be administered to a subject to treat or prevent a disora 

::ra:ri:o,ac..o.oc.H^ 

...uences, o, .mors .,r ,hc Invcnuon nv.y adnun, e «^ ^^^^ ^ 

„, r,«,apcuuc ag«s nuy ac, s,„.,8.s„.a„y ,0 ^ ^ ,^ ^^^.^^^^ ^.^ 

,0 <ss»*,s scribed ah..v. Usins ,ms approach, one n,a> be able 

'° ,„.e, .sa.es o, ea. a.... .us re^cn, ^enUa, ,or ,„ 

, n. rCREC may be produced using mcihcxls which are gene y 
.„ J^^^ ,^ ,„,^es or u, screen Ubr.i.s or 

a,,, ,„parncular.pur,l>edOCRECmayb ^ ,,,„,^oCREC Anybodies .o GCREC may 

pnar^accubca, a.en« .0 rdcnu,, .bosc wbicb sp«:.ficaUy b,nd 
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also be generated using methcxis that are well known in the art. Such anubodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single cham anubodies, Fab tragments, and 
fragments produced by a Fab expression library. NeutraUzmg anubodies (i.e., those which inhibit 
dimcr lormation) are generally prclerred tor therapeutic use. 

5 For the producuon of anubodies, various hosts including goals, rabbits, rats, mice, humans, 

and others may be immunized by injection with GCREC or with any fragment or oligopeptide thercol 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freunds, mineral gels 
such as aiuminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 

10 polyanions, pepudes, oil emulsions, KLH, and dinitrophcnol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

Ft is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
GCREC have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

15 fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches of 
GCREC amino acids may be fused with those of another protein, such as KLH, and antibodies to the 
chimeric molecule may be produced. 

Monoclonal antibodies to GCREC may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 

20 to, the hybridoma technique, the human B-ceil hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985 ) J. 
Immunol. Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Natl. Acad, Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol, 62:109-120.) 

In addition, techniques developed for the production of ' chimeric antibodies," such as the 

25 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad, Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques descrit>ed for the production of single 
chain antibodies may be adapted, using methods known in the art, to pr(xluce GCREC -specific single 

30 chain antibodies. Antibixlies with related specificity, but of distinct idiotypic composition, may be 

generated by chain shuffling from random combinatorial immunoglobulin Ubraries. (Sec, e.g., Burton, 
D R. (1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antih(xlies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
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Anubody iratmcn Ffabl Iraemenis produced by pepsin 

e^Me. ."S,ne„. ... n.. „„u,eO ,0, R^.J . P 

l.c„ur,ca.,on„fn,o.«c,ona,FaM,asn,=n«w.mU«de,si,=.spec,.,u.y.(S«,.*, 

, ,98» Sde«e 246;1275-1281.) „ i^nUly a„ub,.li« l-avmg ihe desired 

Various rmn-unoassays may be used for screerung y 

Kir^Hino nr immunoraciiomcinc assays usini^ tiui 

^^^^^^ 

a^Ubr^y. A ^..-based — .so « 

,„ ,.,„-inre„erins GCREC epUopes ,s generally used, but a co„^" 

c„,pl,>yed (Pound. suDta). „ ,„„,„™m1ou with radiolmmurMassay leelaiiques 

varrous .et^ sueb as ,y is e.pr»sed as a„ asso..atio. 

„,., ,„ used ,„ assess ,he atn^ty of an..bod.» fo, °'=^<^- ,he 
_„,,,,„.b,sde„„edas,be.o,— 

„,„,a, „„ceu„ado. of ^ ^^'^^ IL^^^^^s i„ d.r a,r,..,es for uruidple 
„ „„ a prcparaaon of polyclonal amibodres. „f , be anUbodie,, for GCREC. Tbe K. 

epif^es. rcprcsen. tbe ^^''^^^ ^ZZ. a,e r^.^sp^inc for a parfrcular 
*,cn.n». for a preparation o, —a a ub^. • ^ 
OCRECepito.— a — 

rangmg.romaboutlO lolO Um Low-affinity anUbody preparations w,th K. 

ranging Iron, about 10 .0 10 Umole P ^^^^ 
p,«cdures wbieb ulumately require dissipation o, ^^''^^^ " J „^,„„„„„ oc. 

,„ New York NY). anubody preparations may be furtber evaluated to deternune 

titer a^ a.i.t, m ^^^'^^J^.^^ .,p„ea.ons. exaiiiple. a 

« quality a^ suitabili.y of sucb ^'^"-^^^^^ ^„,^,„„, s-l g 

polyclonal anttbtxly preparation contaiiun. a, '"^ ' ^ ^= ^ ^ ^, ocREC-antib^y 

specific antibcdy/ml. is generally employed ,n procedures requiring p 
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complexes. Prcx:edures for evaluating anUlxxly sp>ecificity, uicr, and avidity, and guidelines lor 
anubody quality and usage in various applicalioas. are generally available. (See, e.g.. Catty, supra , and 
Coligan et al. supra .) 

in another emb(.xliment ot the invention, the polynucleotides encoding GCREC, or any tragment 
5 or complement thera)t\ may be used lor therapeutic purposes. In one aspect, mcxlificatioas of gene 

expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PN A, or modified oligonucleotides) to the ccxling or regulatory regions ol the gene encoding GCREC. 
Such technology is well known in the an, and antisense oligonucleoudes or larger fragments can be 
designed from various locations along the ccxling or control regions of sequences encoding GCREC. 

10 (See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be dehvered 
intracellulaxly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 

15 Slater, J.E. et al. (1998) J. Allergy Cli. Immunol. 102(3):469-475; and Scanlon, K.J. ei al. (1995) 

9( 1 3): 1 288- 1 296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as relrovims and adencvassociated virus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra ; Uckerl, W. and W, Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene dehvery mechanisms include liposome-derived systems, artificial viral envelopes, and other 

20 systems laiown in the art, (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1):1 308-1 315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding GCREC may be used for 
somatic or gernrUine gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 

25 (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 

30 Therapy 6:643-666; Crystal, R.G. et al. ( 1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor Vlll or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
ceil proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
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such as human ,mn.unodefic.ency v.us (HIV) (Balumore, D. (1988) 
a.a.nsi human retroviruses, such as huma . sc, USA 93 1 1395-11399), 

u, c n996) Proc. Nail. Acad. SCI. Uirt. ^'J ' 

.cpa..s B « C v,™s ,HBV, HCV. u "^^^^^^ „^ Tnm«^)- ^ 
p™».o.n - ■ „ ^ cause. *«s. -P— 

GCREC irom an appropnaie population 

caused by the gcneuc deficiency. Hi.nr,lers caused by deficiencies in 

.„ . .unber embcxiiment ot the invention, diseases or disorders cau y 

. ^ hv constructing mammalian expression vectors encoding GCREC 
GCREC are treated by constructing ^^^^^.^^^ ^^^^^^^ 

-e v-rs by m— means ^^^^ ^^^^^ ecUs, <..) ballistic 

use with cells mvivo or eoiiio mclude (.) direct D ^^^^^^^^ 

r:::;::— ^^^^ 

' ma, be eft«.ve f« U» expression of GCREC ,»:.»*, bo, arc no> 

Expression vcoors ma. ,^,„„ (,„„„ogen, CarlsbaO CA), 

U„„ed ,0. >be PGDN A 3, , , EPITAG. PRCCMV2, PREP. 

PCMV-SCmPT. PCMV.TAG, PEGSH/PERV < ,,,,, express.. 

p^.ON, PTREZ PTRE2-LUC, PTK-HVO (Concecb. 
ns,n.O.— ,ae.epron,„«^^^^^ 

,,SV,. SV4,, v,ms. ■byn.a.ne fonase <TK. o,^ ac ^^^^ ^^^^ 

,c,rac,c„ne.resula.edpron«.r (Gossen, M. and H Bu^ard ) ^^^^ ^^^^^^ 

..1 nQ95^ Science 268:1766-1769, Rossi, r.ivi. 
89:5547-5551; Gossen, M. et al. (1995) (,nvitrogen)); the 

o ...hnni g 451-456), commercially available in theT KtA p 
Curr. Opin. B.otechnol. 9.45 l-^-o,, ^vrRXR and PlND: Invitrogen), the 

„ ™a.SEECT,ON .T, a,a„ab.e ,ro. ,n..o.e„, a,,,» one — 
po.„.,.Udcs . rar.e, ce„s ,n euKore and — "J J.^^ 

;rrE:BO:™^.^.r::,— no,O.A.opr,.ar,ee.^ 
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standardized mammalian iransfecuon prouxols. 

In another embodiment ot the mvenuon, diseases or disorders caused by eenetic delects with 
respect to GCREC expression arc treated by constructmg a retrovirus vector consisting ol (i) the 
polynucleoude encoding GCREC under the control ot an mdependent promoter or the retrovirus long 
s terminal repeat (LTR) promoter, (li) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) alone with additional retrovirus ct-y-acLing RNA sequences and coding sequences 
required lor efficient vector propagation. Retrovirus vectors (e.g.. PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, T. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herem. The vector is propagated in an 

n» appropriate vector producing cell line (VPCL) that expresses an envelop>e gene with a tropism tor 
rccepKus on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. ct al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M A. and 
A D Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Methcxl for obtaining 

15 retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant'') discloses a 
method for obtaining retrovirus packaging cell lines and is hereby incorporated by rclerence. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), and the 
return ot transduced cells to a patient are procedures well known to persons skilled in the an of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 

20 al. (1997) BItxxl 89:2259-2267: Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 
(1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Bkxxl 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding GCREC to cells which have one or more genetic abnormalities with respect to 
the expression of GCREC. The construction and packaging of adenovirus-based vectors are well 

25 known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. el al. (1995) Traasplantation 27:263-268). Potentially useful adenoviral vectors arc 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinoz/i, P. A. et al. (1999) Annu. 

30 Rev. Nutr. 19:511-544 and Verma, l.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding GCREC to target cells which have one or more genetic abnormalities with 
respect to the expression of GCREC. The use of herpes simplex virus (HS V)-based vectors may be 
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, ■ rrREC.o cells oliheccrareucrvous system, to «li.chHSV has a 
cspecall, valuable lor m./.*cmg GCREC ^.^^ 

s.11 ,„ « a,,. . .pl.a.,n-co„,pe« .c.p.s s,.p,» ^ 

pa.™ N„,„» 5,S«.,3 Deuuca P ,e use oUeco.b.nan, HSV 

hereby incorr.ira.ed by reterorKe, U S, Paieni i,j„„er,ed .o a cell under 

„h,cb consrsls ,„ a .e„o„. conral.n, a. leas, .,ne exogenous ,e„. b^u- 
,he con.ro, ol 0. approprl.re .on„.c P»r,«ses Inclu.n, bunra^^c^ .^.. ^ - 
,„ pa,enlare,beco_a.useo.— 

::er::.nera.on„,reco„b,na„.,rus,o,— ^^^^^ 

'" T " ^er allernarlvc an alphavlrus (p^rHve. slnsle-s.arKle. RNA vrrus, v«„r Is used ,o 
,„ anoiber allernabv. a P ^ ^^^^^^ ^ ^^^^^^ 

deliver polynucleoUdcs encoding OCREC urg ^^^^^ ^„ 

„ . r^Fvr has been sludled extensively and gene transler vectors na 
Senrlrlo Fores, V,rus<SFVO, has b^ „„„«Curr Opln Bio.ech™>l. 9:4M^6.,. During 
,he SFV genon« (Oaroff, H. and K -J. U (,99S, C ^^^^ ^ ^^^^^^ 

...rep,lcaaon. a^^ub^-^^^^^^^ 

protcns, TbissubgenomicRNArepbcaresto , „„„,ns wid, enzymauc activity (e.g , 

,n .eoverproducbonofcapsidprr-elasrelauvetotbe viral prote.^»,.ey^^ ^^^^^ 

pr.nease and poly-as* Si.-ly. ir^erUng U,e c««ng ^J^^^J^^^^^^^^ , 

.no„.,npl.ceot..capsld-cc.ngreg— 

c,x«nu RN AS and the synltos of h,gh levels o, OCREC ,n 

,s ,y.,c.,ly ass,.,a,ed »ltb cell lysis w„b,n a le. day... a^*. „ P^^^_^^ ^^^^ 

11.- ruUK with a variant oi ^inuui^ vuu. v 
t su., the neec. o, the gene therapy application 

...c rcpucatron 01 alphav.nrses al,o« the 

,„ toryga. S.A e. al, (1997, V,r*gy » ,^ „^„^,„„,,„ „, , of cells u, a 

introduction of OCREC into a variety o, cell types. The spec, 
p.,ula.,on„ayret,u,re-«s.«lngofce,,spr,or,o,ra— ^^-^^^ 
,„ccuous cDNA cones of alphavtruses. per,or,b.„g alphavirus 
perlornang alphav,™s ,n,ech„ns. are „el, .own ,o ,bose .ith or.nary s.,1 ,n an. 
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Oiigonuclcoudes derived from The irarLscripiion initiation site, e.g., between about pi^sitjons -10 
and + 10 from the start site, may also be employed to inhibit gene expression. Sinularly, inhibiuon can 
be achieved using triple helix base-pairing niethcxlology. Triple helix pairing is uset\il because it causes 
inhibition of the ability of the double helix to open sufficiently lor the binding of polymerases, 
5 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g.. Gee, J.E. et al. (1994) in Huber, B.E. and B.i. Carr, 
Molecular and Immunologic Approaches , Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation oi mRNA by 
preventing the transcript from binding to ribosomes. 

10 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytjc cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding GCREC. 

15 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: QUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 

20 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chenucal synthesis. 

25 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding GCREC, Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, conslitutively or inducibly, can be introduced into cell lines, cells, or 
tissues. 

30 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of fianking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioaie or T O-mcthyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PN As and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, qucosinc, and 
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ct>mbinaional library of oligonucleoudes (such as dcoxyribonucieoUdes, nb<.)nuclcoades, pepude 
nucleic acids, and modified oligonucleoudes) lor anuscnse activity against a sf>ecitic polynuclcoude 
sequence (Bruice, T.W el al. ( 1997) U.S. Patcni No. 5,6K6.242; Bruice, T.W. et aJ. {2(X)0) U.S. 
Patent No. 6,022,691). 

5 Many methods tor imrcxlucing vectors into cells or tissues arc available and equally suitable lor 

use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated lor autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

10 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 

15 generally comprises an active ingredient formulated with a pharmaceutically acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations arc commonly known and are thoroughly discussed in the latest edition of Rer.-ington's 
Pharmaceutical Sciences (Maack Publishing, Easion PA). Such compositions may coasist of GCREC, 
antibcxlies to GCREC, and mimetics, agonists, antagonists, or inhibitors of GCREC. 

20 The compositions utilized in tfiis invention may be administered by any number of routes 

including, but not limited to, oral, intravenous, intramuscular, intra- arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary adminisu-ation may be prepared in liquid or dry powder form. 

25 These compositions are generally aerosolized inrunediately prior to inhalation by the patient. In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-actjng 
formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via Uie alveolar region of the lung have enabled 
the practical delivery of drugs such as insulin to bkxxJ circulation (see, e.g., Patton, J.S. et al., U.S. 

30 Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Comp<:>sitions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an elfective amount to achieve the intended purpose. The determination of 
an effective dose is well witfun the capability of those skilled in the art. 
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Those skjiled in the art will employ dittereni torniulauons tor nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polvpepudes will be specific to particular cells. 
condjU(ms, locations, etc. 
DIAGiNOSTICS 

5 In another embodiment, antibodies which specifically bind GCREC may be used for the 

diagnosis of disorders characterized by expression of GCREC, or in assays to monitor patients being 
treated with GCREC or agonists, antagonists, or inhibitors oJ GCREC. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above tor therapeutics. 
Diagnostic assays for GCREC include methcxls which utilize the antibody and a label to detect GCREC 
10 in human body fluids or in extracts of cells or tissues. The antibodies may be used with or without 
mcxlificaiion, and may be labeled by covalent or non-covalent attachment of a reponer molecule. A 
wide variety of reporter molecules, several of which are described above, are known in the art and may 
be used. 

A variety of protocols for measuring GCREC, including ELISAs, RIAs, and FACS, are known 

15 in the art and provide a basis for diagnosing altered or abnormal levels of GCREC expression. Normal 
or standard values for GCREC expression are established by combining body fluids or cell exu-acts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to GCREC 
under conditions suitable for complex formation. The amount of standard complex formation may be 
quantilated by various methcxls, such as photometric means. Quantities of GCREC expressed in 

20 subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding GCREC may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonuclecttide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 

25 quantify gene expression in biopsied tissues in wfuch expression of GCREC may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression ot 
GCREC, and to monitor regulation of GCREC levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of delecting polynucleotide 
sequences, including genomic sequences, encoding GCREC or closely related molecules may be used to 

30 identify nucleic acid sequences which encode GCREC. The specilicity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplificauon will determine whether the 
probe identifles only naturally occurring sequences encoding GCREC, allelic variants, or related 
sequences. 
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mood, anxjcty, and schizophrenic disorders, seasonal altective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, Touretle's disorder, progressive supranuclear palsy, conicobasal degeneration, and familial 
Irontotcmporal dementia; a cardiovascular disorder such as arteriovenous fistula, atherosclerosis, 
5 hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins. 

thrombophlebitis and phleboihrombosis, vascular tumors, complications ot thrombolysis, balloon 
angioplasty, vascular replacement, and coronary anery bypass gralt surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial inlarction, hypertensive heart disease, 
degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid aortic valve, 

10 mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbactcnal thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac u-ansplaniaUon; a gastrointestinal 
disorder such as dysphagia, peptic csophagitis, esophageal spasm, esophageal stricture, esophageal 

15 carcinoma, dyspepsia, indigestion, gastritis, gastric carcinoma, anorexia, nausea, emesis, 
gastroparesis, antral or pyloric edema, abdominal angina, pyrosis, gastroenteritis, intestinal 
obstruction, infections of the intesunal tract, peptic ulcer, cholelithiasis, cholecystitis, cholestasis, 
pancreatitis, pancreatic carcinoma, biliary tract disease, hepatitis, hyperbilirubinemia, cirrhosis, 
passive congestion of the liver, hepatoma, infectious coUtis, ulcerative colitis, ulcerative proctitis, 

20 Crohn's disease, Whipple*s disease, Mallory-Weiss syndrome, colomc carcinoma, colonic 

obsu-uciion, irritable bowel syndrome, short bowel syndrome, diarrhea, constipation, gastrointestinal 
hemorrhage, acquired immunodeficiency syndrome (AIDS) enteropailiy, jaundice, hepatic 
encephalopathy, hepatorenal syndrome, hepatic steatosis, henKx:hromatosis, Wilson's disease, alphai- 
anti trypsin deficiency, Reye's syndrome, primary sclerosing cholangitis, liver infarction, ponal vein 

25 obstruction and thrombosis, cenu^ilobular necrosis, peliosis hepaiis, hepatic vein thrombosis, veno- 
occlusive disease, preeclampsia, eclampsia, acute fatty liver of pregnancy, intrahepatic cholestasis of 
pregnancy, and hepatic tumors including nodular hyperplasias, adenomas, and carcinomas: an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 

30 asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 

polycndocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholccysUtis, contact 
dennaiiiis, Crohn's disease, atopic dermatitis, dcrmati^myositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's 

35 thyroiditis, hypereosinophilia, imtable bowel syndrome, multiple sclerosis, myasthenia gravis, 
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patient begins to approximate that which is observed in the normal subject. The results obtained troni 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
5 overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means tor detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of tliis type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or lurther 
progression of the cancer. 

10 Additional diagnostic uses for oligonucleotides designed from the sequences encoding GCREC 

may involve the use of PGR. These oligomers may be chemically synthesized, generated enzymaticaliy, 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 
GCREC, or a fragment of a polynucleotide complementary to the polynucleotide enccxling GCREC, and 
will be employed under optimized conditions for identification of a specific gene or condition. 

15 Oligomers may also be employed under less stringent conditions for detection or quantification of 
closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
cnctxiing GCREC may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 
substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 

20 in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding GCREC are used to amplify DNA using the 
polymerase chain reaction (PGR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 

25 and tertiary structures of PGR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers arc 
tluorescently labeled, which allows detection of the amplimcrs in high-ihroughput equipment such as 
DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 

30 overlapping DNA fragments which assemble into a common conseasus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical m<xlels and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high 
throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 
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case of a ussue or biopsy sample, or in vitro , as in the case ot a cell line. 

Transcript images which profile the expression ot the polynucleoudes ot the present invention 
may also be used in conjunction with in vitro model systems and preclmical evaluation ot 
pharmaceuticals, as well as toxicological testing ot industrial and naturally-cxjcurring environmental 
5 compounds. All compounds induce characteristic gene expression patterns, frequently termed 

molecular fingerprints or toxicant signatures, which arc indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steincr, S. and N.L. Anderson (2000) 
Toxicol- Lett. 112-11 3:467-47 1 , expressly incorporated by reference herein). If a test compound has a 
signature siimlar to that of a compound with known toxicity, it is likely to share those toxic properties. 

10 These fingerprints or signatures are most useful and refined when they contain expression inlormation 
from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 
provides tfie highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normahzation prcx;edure is useful for comparison of expression data 

1 5 after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example. Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2(K>0, available at http://www.niehs.nih.gov/(xVnews/toxchip.htm.) Therefore, it is 

20 important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 

25 present invention, so thai transcript levels corresponding to the polynucleotides of the present 

invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an unu-eated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compt^und in the treated sample. 

Anotf>er particular emb<Kliment relates to the use of the polypeptide sequences of the present 

30 invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 

pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, arc 
analyzed by quantifying the number of expressed proteins and their relauvc abundance under given 
conditions and at a given time. A profile of a celfs proiame may thus be generated by separating and 
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r,r:teo..e.^^^ 

. ar-aiy^ed i. paraiiei with toxicant s„„a.„r« at the .a,.cr.p. ,eva ^e a^ ^ 

„„eentra.cr,pt andproteir, abundances ior son»p.o.=,ns .n ^^J^ ^^ay ^ use.i .n the 

•o 1 Q-^^^ <;^7> so proteome toxicant signatures may 
Se„nan« (,997) E,ec,rophores,s '^'^ ,„er the 

analysts o. compounds which do not signTicamlyallect the " ^ 

,_.cpro„,e.,nad.do,,,t.anaiysiso,tra.sc.pt.n^^^^^^^^^^^^ 
„.„„,^....prc.»nncp~^^ 
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degradation of mRNA. so proteontic pro„l,ns may ^ 
,,,.„n„cn_..,hcto.ic,tyoU..t.— sa^^^^ 

«n,pieconta.n,nsprote,ns with the test compound, Pr.«e,nsth P 

„„p,c arc separated so that the amount of each protein can he ^ ^,,,^,„„ 

,„„parcdtot,«a.n.nt„f,hcc<.esp.n.n.pr.Heininanu„«^^^^^ 
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individual proteins and comparing these partial sequences to the polypeptides of the present invenuon. 

in another enibcxliment, the toxicity ot a lest compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins Irom the biological sample are incubated 
with antibodies specific to the polypeptides ot the present invention. The amount of protein recognized 

5 by the antibodies is quantified. The amount ol protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brcnnan. T.M. et al. (1995) U.S. Patent No. 5.474,796; Schena, M. et al. (1996) Fr(x:. Natl. Acad. Sci. 

10 USA 93:10614-10619; Baldeschweiler et aK (1995) PCT applicauon W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarravs: A Practical Approach , M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

15 In another embodiment of the invention, nucleic acid sequences enccxling GCREC may be used 

to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
ccxling or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybndization during chromosomal 

20 mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 

chromosome, or to artificial chromosome coastructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 

25 7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop geneUc 
linkage maps, for example, which correlate the inheritance of a disease slate with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, for 
example, Lander, E.S. and D. Botsiein ( 1986) Proc. Natl. Acad, Sci. USA 83:7353-7357.) 

Ruorcscent in situ hybridization (FISH) may be correlated with other physical and genetic map 

30 data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of geneUc map 
data can be found in various scicnufic journals or at the Online Mendelian Inheritance in Man <OMIM) 
World Wide Web site. Correlation between the location of the gene encoding GCREC on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional clomng effoas. 
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,„ ,„a .yb,id.za.o„ >,f chro-nosom,. preparauoa, am, phys.ca. -pp.ns .«hni^ue,. sue. a. 

a"^. u.,n, ..a.„.^ *„„«„™, n,a,.e., „.a, « „.«i e.c„«, ^^^^ 
0.cn placemen, o, a s=ne on cn,.™»n,e o, .n«« n,an^a„an .pec,c». such a. n,o..e. n,a, 
„vea, a,s..ia,ed „,a,.e. even „ me exac, o«na, ,...s . noc ^own. ™s ,n.o,n,a«,n ,s 
„,„a.e,o,n.esU.a,o..ea..n.,«..ea..nes„...po.aona,c,„n,„.„„^^ 

once ,.e .e™ « .e„e» .es.,ns,* r„ a .sease s,n*o„,e '"^^^ 
eeneuc ,n*a,e u, a par„»,a, ,e„o„.c re.on, e... a,a.a..e,an„.us,a .c , J 
Ipp.„g ,o U,a, a,ea ,nay .cpresen, a.sc.,a,e. o, ,eg.,a>o,y .en. re ^.rcher ,nves.,gauo. (See. e ... 
: L, e. a,. Na,„,e 33.3,,.3«0., n»c,e..ae se<.ence .,r*e ,ns,an, ,nve„„o„ „,a, 

, a,s„ ^ nse. .o «ec, a.fe,=nccs ,„ .^aUon *e » -a„.ocaU„n. invers.on, e,e., 

a™,„g »r™,, ca.,er, „ ,n=e,e. ,nd,v,dua.. 

In another embodimem 01 Uic invention, GCREC, us calalyuc. 

..opepU^ mereo, can use. .« seteenln, ,i.ra.e. or coinpou^. ,n any or a va„e,y o, o™ 
.JeeL. technipues. The rra.n«n, e.p,oyeO In sncn scree^g may be free in so,„uon, affixed c 
. ::::po.,Jneonace.sa.ace..,«al..n,,ace„nU.y.T..-.a,,o„-,n.^^ 

between GCREC and ihe agent being tested may be measured. 

^noinet ,.^.u= r« o™, sc.ee.n, ptov^Oe. ,0, C...s.,pu. — 
.avin, suitable bl„*s ami-ty to t« prolein of interest, (See, e.^,, Geysen, et a,, (...4) 
appuLon WO«4/03564,> ,n this .emod, latge nuinb^s o, dltrcen, sm.,, test compt^nd, a, 
,„ L.^ on a solid subsltale. ^ les, compounds a,e reacted »Ub OC.BC, o, — ,b.eo 
»d wasted, Bou^l GCREC is tben detected by ntetb^. we,, ^w„ in the an, Pun.ied GCREC 
a,s<, be coaled direary onto plates ,0, use in the atorenenuoned drug screening techniques. 

::llve,y, non.— ,ng anUbt.. can be used to capture the peptide and im.obi,i.e i. on a 

T:c.e. enthCh^nt. one h.y use co.^Uve drug serening assays ,n 

an.bod.es capable or binding GCREC specilically compete wi^ a ,esl compt^nd lor b.^lmg GCREC. 
:„T:nn:,a„d..esca„.usedtode,ec,thepresence„ranyP.p.idew.chshar.„neori,i™e 

antigenic deternilnaras with GCREC. rCBFC may be used in any 

,„ addluona, embodiments, the nucleotide sequences which enc.* GCREC may 
„ molecular biology techniques that have ye. ,o . developed, provided the new '^^^^ 
properties o, nucleoUde s^ences that are currently ^.iwn. inelu^ng. hu, not „n.,a, to. 
properties as the triplet gcnedc ctxle and specific base pair interactions. 

Without further eiaKiralion, it is believed that one sKilled in the an can, using the preceding 
deschpuon. utilize me present invenhon to its lullest extent. -Hie foliowing embodiments are. 
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theretbrc, to be construed as merely illustrative, and not limitative ot the remainder of the disclosure 
in any way whatsoever. 

The disclosures of all patents, applications and puhlications, mentioned atx)vc and below, m 
particular U.S. Ser. No. 60/180,093 and U.S. Ser. No. 60/182,045, are expressly incorporated by 
5 reference herein. 

EXAMPLES 

I. Construction of cDNA Libraries 

lnc>tc cDNAs were derived from cDNA libraries described in the LIFESEQ GOLD database 

10 (Incyte Genomics. Palo Alto CA) and shown in Table 4, column 5. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate, while others were homogemzed and lysed in phenol or in a 
suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic soluUon of phenol 
and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted 
with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 

1 5 eihanoU or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX niRNA purificaUon kit (QIAGEN). Alternauvely, RNA was 

20 isolated directly from tissue lysates using other RNA isolauon kits, e.g., tfie POLY(A)PURE mRNA 
purification kit (Ambion, AusUn TX). 

In some cases, Siratagene was provided with RNA and constructed the corresponding cDN A 
libraries. Otherwise, cDNA was synthesized and cDN A libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 

25 recommended pr(x;edures or similar methcxls known in the art. (See, e.g., Ausubel, 1997. supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restricUtin enzyme or en/.ymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL SKKK), SEPH AROSE CL2B, or SEPH AROSE CL4B column 

30 chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid (Life Technologies), PCDNA2.1 plasmid 
(Invitrocen, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (Incyte Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells 
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^ nH^fi DHIOB, or EleciroMAX 
,„c,ud,„s XL,-B,ue, XL,-B..eMRf , o, SOLR To. S.augencor DH5„. 

DHIUB irom Lite Technologies. 

U. Isolation of cDNA Clones ^ ^^^^^^^^^ 

Plasnuds obtained as described in Example 1 were recovered from host cel. by 

, rs.raia.ene) or by cell lysis. Plasmids were punt.ed using at least 
using the UNIZAP vector system (Sirau^ene) or Dy (Promcca)' an AGTC 

^rWI7ARDMiniprepsDNApunticauon system (Promcga>, 
one ol the following: a Mag.c or WIZARD M p P ^^^^^^ ^^^^^^ 

M..prep punncaUon (Edge Biosys.ems, Gaithersburg MD), and Q A 

O.AWE...UltraPlasm.dpur.r.caUonsy^^^^^^^^ 
purUlcaUon Kit Irom QIAGEN. Following precipitauon. plasm^ds were resusp 
0 disUlled water and stored, w.th or without lyophili.ation. at 4 C ^ 

Alternauvely plasnud DN A was amplified trom host cell lysates using di 
Alternauvely, p ^^^^^^^ ^^^^ 

.„hput format^aa ^-^^^ ^^^^.^ _ ,_sed and stored in 3S.- 

cycling steps were carried out ,n s n.le n„tr.ca.ly using 

well plates, and the concentration of amphfied plasn.d 
15 PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN 
(Labsystems Oy, Helsinki, Finland). 

1,1. Sequencing and Analysis sequenced as follows. 

„,C.O..B.0O.— ^^^^^^ 

using reagents provided by Amersnam rn Biosystems). 

----- rrx::==--->-^^^^ 

25 ElectrophoreUc separation ot cDNA scquencmg Dynamics); the ABI 

^ vipr AR ACE 100() DN A sequencing system (Molecular Dynamics;, 

irames wuh.n the cDN A sequc ^^^^^ ^^^^^^^ 

suEia. unit 7,7). S(.me of the cDN A sequences were selected 



.•?o 

disclosed in Example Vlll. ^ wvte cDN As were validated by removing vector. 

The polynucleotide sequences derived irom Inc^c ^^^^ 

hv maskinc ambiauous bases, using algorithms anu p _ 
linker, and poly( A) sequences and by masking . ^^^^ 
on BLAST, dynantic programn.ng, and dinucleoUde nearest neighbor analysis. 
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sequences or translatjons thercot were ihen queried against a selection o( public databases such as the 
GenBank primate, rtxlent, niammahan, vertebrate, and eukaryotc databases, and BLOCKS, PRINTS. 
DOMO, PRODOM, and hidden Markov nnxlel {HMM)-based protein Taniily databases such as PFAM. 
(HMM IS a probabihsuc approach which analyzes consensus primary structures ot gene lamilies. 

5 See. for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) The queries were 

pertormed using programs based on BLAST, FAST A, BLIMPS, and HMMER. The Incyie cDNA 
sequences were assembled to produce lull length pi^^lynucleoude sequences. Alternatively, GenBank 
cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding 
sequences (see Examples IV and V) were used to extend Incyte cDNA assemblages to hill length. 

10 Assembly was pcrlormed using prograias based on Phred, Phrap, and Conscd, and cDNA assemblages 
were screened for open reading Irames using programs based on GeneMark, BLAST, and PASTA. 
The hill length polynucleotide sequences were u*anslated to derive the corresponding full length 
polypeptide sequences. Alternatively, a polypeptide of the invention may begin at any of the methiomne 
residues of the full length translated polypepude. Full length polypepude sequences were subsequently 

15 analyzed by querying against databases such as the GenBank protein databases (genpept), SwissProt, 
BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and hidden Markov model (HMM)-based protein 
family databases such as PFAM. Full length polynucleotide sequences are also analyzed using 
MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments are 

20 generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
MEG ALIGN mulUsequence alignment program (DNASTAR), which also calculates tfie percent 
identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyie cDNA and full length sequences and provides applicable descriptions, references, and threshold 

25 parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column presents, where 
applicable, the scores, probability values, and other parameters used to evaluate the strength of a match 
between two sequences (the higher the score or the lower the probability value, the greater the identity 

30 between two sequences). 

The programs described above for the assembly and analysis of full length f>olynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:22-42. Fragments from about 20 to about 4(KX) nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 4. 
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.denUf.ca««n and Editing of Coding Sequences from Genomic DN A 

organisms (See Burge, ana ^. ^r^HuMeti exons lo torm an 

C„„, op.. S..«. B,o,, S,3«.35.,. T.e p.... »— P-<^- J ^ 
_,.,.a .ONA sequence »,e„.n. .on, a ^..o.. u. a ^J^^^^^ 

. c^t tr^ ^t) kb To determine which ol thei>e oenscan 

genpepl and gbpn public databases. Where necessary, ^^.^^^ 
.V, .,,r,RT AST hit ttomeenpept to correct errors m the sequent 1 

Oeascan, such as ex.ra o, o^ued exons. BLAST ana,^ 

----- --^':rrr;ra— ,o;o.,ec...^^^^^ 

Censcan pre.«c,ad se<,.«.e. .=ng^ Po V ^^^^ ^^^^ ^^^^ „^,„^ 

U« assembly process described m Example III. Allernau 

<«wcd en..c,y from cdired « une^red Oe.can.pre..c.ed codln, s»^e^.^ 

V. Assembly of Cenonrlc Se,ue„« D.U »IU. cDNA .e,u.nce Da.. 

"Stitched" Sequences . Genscan gene ideniification 

Par,.aUDNAse.ue„ceswercex,endedw,lbexonspred,cedby^G^ca 

- p.o.a.d.cr,bedl„Exa.plelV,Parr,alcO.Asa— ^.^^^^^^^^^^ 
,„,e„on.cO.Aandp.rscd,n,oc.s— 

- - ^""""^ ^":::,re CDMA .nd .^.c ..o,n.Uon, .enerabn. ..slble spl,ce 
a,Hldynan«cprogrann,»ng,o,mesra.ecDN - ,„ „eate a (nil lenab sequence. 

vananis .a, were subse.„en„y co..^ edl.ed. or -"^ " ^ ^^^^ ,„ 

Se.uc«c.„.ervals,„.«b,..e„breleng,bo,,nc,n,ervalw.spr.c™ n 

»ere ,den,.n.. and Inler.als ,bus ,den„„.. were ^ ^ ^„ 

..cxa.ple.,,an..er.lw.pre.n.— 

were considered to be equivalent. This process 
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be brought together, bridged by cDNA sequence. Intervals tbus idenufied were then ' stitched" together 
by the stitching algorithm in the order that they appear along their parent sequences to generate the 
longe^st possible sequence, as well as sequence variants. Linkages between intervals which proceed 
along one t>pe ol parent sequence (cDN A to cDNA or genomic sequence to genomic sequence) were 

5 given preference over linkages which change parent type (cDNA to genomic sequence). The resultant 
stitched sequences were translated and compared by BLAST analysis to the genpepl and gbpri public 
databases. Incorrect exoas predicted by Genscan were corrected by comparison to the top BLAST hit 
from genpept. Sequences were further extended with additional cDNA sequences, or by inspection ol 
genomic DNA, when necessary. 

10 ^'Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example III were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases using 
the BLAST program. The nearest GenBank protein homolog was then compared by BLAST analysis 

15 to either Incyle cDNA sequences or GenScan exon predicted sequences descrit>ed in Example IV. A 
chimeric protein was generated by using the resultant high-scoring segment pairs (HSPs) to map the 
translated sequences onto the GenBank protein homolog. Insertions or deletions may occur in the 
chimeric protein with respect to the original GenBank protein homolog. The GenBank protein homolog, 
the chimeric protein, or both were used as probes to search for homologous genomic sequences from the 

20 public human genome databases. Partial DNA sequences were therefore "stretched" or extended by the 
addition of homologous genomic sequences. The resultant stretched sequences were examined to 
determine whether it contained a complete gene. 

VI. Chromosomal Mapping of GCREC Encoding PolynucleoUdes 

The sequences which were used to assemble SEQ ID NO:22-42 were compared with 
25 sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:22-42 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
^0 Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map locauon. 

Map locations are represented by ranges, or intervals, or human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome" s p- 
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M „ ,.M> « . »ni. of measuremcm h.sed „„ ,«o,.b,nauon frequences «™ee„ 

J,s,a™es are based on geneuc nrarkers mapped by ^^^^ 
„,br,d n,ar.er. wbose .e.„e„ces .ere inCnded n 

„.er re».ree, a,a„ab,e ,o d,e ^ "^^^^ „ ,denu„ed 

(tmp //www ,.cbi.r,lm.iuh-gov/seaemap/). can be employ 

l,le .ene. „.ap w...n „, .n pro«.nrUy u, .nfervafs ,nd.a,ed above^ 
VI.. Analysis «f P„lynucle«Ude Expression ^^^^^^ ^ 

..bernana.ys..sa_^^^^^^^ 

::;:::r::r - ... sa.br». ^. . ^u... o.. 

„,„,„„ BLAST were used ro sear* for laenuealorrelaied 
~» eompnrcr ■«l».^- ap lyrng BL^ ^^^^^^^^ 

„.ec.es rneONA— sncb « ^ ^^^^^^ ,,,e.ldvUy o, *eco.p»-er 

rasr. .ban muldple n«n,bra,«.based "V^— ^ „ 

search can be modified 10 derermine wbed« aay paiucuUr malch 
basis Of U-e searcb is .be product score, wbicb is defined as: 



P\ A<r Slrore X P^rt-nt Identity, 



' 5 X minimum {length(Seq. D, lcnglh(Seq. 2)} 

. ,h ,hc dceree ol similarity between two sequences and the length 
The prcKluct score taKes mto account both tbe degree oi J 

,.>, Thf nroduct score is a normalized value oeiwecii 

./::,r::r::,smu.dp...by..cen,b..^^^^^^^^^ 

n.sb.e— 

,ban one HSP, ..«n pa,r wifb bi.«s, BLAST core - 

.n,. u balance between fracuonal overlap and quality m 
,„ prr^uc, score rcpresen.s ' „„„ enure le„.,h of ^ sbor.er 

example, a pr.xiucf score o, 100 ^ ^ ,^ „ ,«« idendfy and 

„r ,be fwo sequences bc,.,g compared. A produc. sec „, 5,, ,s 

, .,h»«R%ideraity and 100% overlap allbeolhei. A proou 
70* overlap al one end. or by 88% .aenuty 

prcxfucr^erlberby idenUly and 50% overlap a, one end, or 79% ident . 
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Alternatively, px^lynucleoude sequences encoding GCREC are analyzed with respect to tiie 
tissue sources from which ihey were derived. For example, some full length sequences are assembled, 
at least in pan, with overlapping Incyte cDNA sequences (sec Example III). Each cDNA sequence is 
derived from a cDNA library consuucied from a human ussue. Each human tissue is classified into one 
5 of the lollowing organ/tissue categories: cardiovascular system; connective tissue; digestive system; 
embryonic structures; endocrine system; exocrine glands; gerutalia, female; genitalia, male: germ cells; 
henuc and immune system; Uver; musculoskeletal system; nervous system; pancreas; respiratory 
system; sense organs; skin; stomalognathic system; unclassified/mixed; or urinary tract. The number of 
libraries in each category is counted and divided by the total number of libraries across all categories. 

to Similarly, each human tissue is classilied into one of the following disease/condition categories: cancer, 
cell line, developmcntaU inflammation, neurological, trauma, cardiovascular, p(X)led, and other, and the 
number of libraries in each category is counted and divided by the total number of libraries across all 
categories. The resulting percentages rellect the tissue- and disease-specific expression of cDNA 
encoding GCREC. cDNA sequences and cDNA library/tissue information are found in the LIFESEQ 

15 GOLD database (Incyte Genomics, Palo Alto CA). 

VIIL Extension of GCREC Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment. One 
primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 

20 synthesized to initiate 3' extension of the known fragment. The iniUal primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target sequence 
at temperatures of about 68 '^C to about 72 "C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

25 Selected human cDNA libraries were used to extend the sequence. If more than one extension 

was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PGR 
was performed in 96-well plates using the PTC-2(X) thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg'*, (NH4)2S04, 

30 and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enyymc 
(Life Technologies), and Pfu DNA polymerase (Stratagcne), with the lollowing parameters tor primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94"C, 15 sec; Step 3: 6()"C, 1 min: Step 4: 68°C. 
2 mm; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68''C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
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specifically described, essenually the same prcxedure is used wiUi larger nuclcoude tragmenis. 
Olieonucleoudes are designed using siaie-ot-lhe-an software such as OLIGO 4.06 sollware (Nauonal 
Biosciences) and labeled by combining 50 pmoi ot each oligomer, 250 /^Ci of [y-'^P] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN\ Boston 
5 MA). The labeled oligonucleotides are subsianually purified using a SEPHADEX G-25 superfine size 
exclusion dexlran bead column (Amersham Pharmacia Biotech). An aliquot containing 10^ counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase 1, Bgl II, Eco RI, Pst I, Xba I, or 
PvTi H (DuPont NEN). 

10 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schucll Durham NH). Hybridization is carried out for 16 
hours at 4()°C. To remove nonspecific signals, blots are sequenually washed at room temperature 
under conditions of up to, for example, 0. 1 x saUne sodium citrate and 0.5% sodium d<xlecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 

1 5 compared. 

X. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric prinUng (ink-jet printing. See, e.g., Baldeschweiler, supra.), mechanical 
niicrospotting technologies, and derivatives thereof The substrate in each of the aforementioned 

20 technologies should be uniform and solid with a non-porous surface (Schena (1999), supra). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. AlternaUvely, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and may 

25 contain any appropriate number of elements. (See, e.g., Schena, M, et al. (1995) Science 270:467-470; 
Shalon, D. el al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotcchnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
30 selected usmg software well known in the art such as LASERGENE software (DNASTAR). The array 
Clements arc hybridized with polynucleiuidcs in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleoudes from the biological sample are removed, and a 
nuorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
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concentrauon of UK) ng/^J, is loaded into the open capillary prinung element by a high-speed robouc 
apparatus. The apparatus then deposits about 5 nl ot array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagcne ). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distjilcd water. 
5 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buttered saline (PBS) (Tropix, Inc., Bedford MA) for 30 nunutes at 60''C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 |i3 of sample mixture consisting of 0.2 Mg each of Cy3 and 
10 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridizaUon buffer. The sample 

mixture is heated to 65'' C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cnr coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addiaon of 
140 fil of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
15 about 6.5 hours at 60''C, The arrays are washed for 10 min at 45''C in a first wash buffer (IX SSC, 
0. 1 % SDS), three times for 10 minutes each at 45''C in a second wash buffer (O.IX SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral Unes 

20 at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5- The excitaUon laser light is 
focused on the array using a 20X micro.scope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

25 In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hainamatsu Photomcs Systems, Bridgewaier NJ) correspwnding to the two tluorophores. Appropnate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used arc 565 nm for Cy3 and 650 nm for Cy5. Each array is 

30 typically scanned twice, one scan per fiuorophore using the appropnate filters at the laser source, 
although the apparatus is capable of recording the spectra from bi^^th Huorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentrauon. A specific locaUt^n on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

35 hxation to be correlated with a weight raUo of hybridizing species of 1 :100,0(X)- When two samples 
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(AcMNPV), conimonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding GCREC by either homologous recombmation or bactenal-mediaied 
transposition involving transfer plasmid intermediaies. Viral inlectjviiy is maintained and the strong 
polyhedrin promoter drives high levels of cDNA iranscnptjon. Recombinant baculovirus is used to 
5 infect Spodoptera frueiperda (St9) insect cells in most cases, or human hepatocytes, in some cases. 
Inlcction of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. ci 
al. (1994) Proc. Natl. Acad. Sci. USA 91 :3224-3227: Sandig, V. el ah (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, GCREC is synthesized as a fusion protein with, e.g., glutathione 

10 S-traasferase (GST) or a peptide epitope tag, such as FLAG or 6-His, pernutting rapid, smglc-step, 

aftinity-bascd purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma iaponicum , enables the purification of fusion proteins on immobilized 
glutatfuone under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from GCREC at 

15 sf>ecitlcally engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified GCREC obtained by these methcxis can be used directly in the assays shown in 

20 Examples XVI, XVII, and XVllI, where applicable. 
XIII. Functional Assays 

GCREC function is assessed by expressing the sequences encoding GCREC at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 

25 include PCMV SPORT (Life Technologies) and PCR3. 1 (Fnvitrogen, Carlsbad C A), both of which 

contain the cytomegalovirus promoter. 5-10 fug of recombinant vector are transiently transfccted into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. \-2 fug of an additional plasmid containing sequences encoding a 
marker protein are co-iransfcctcd. Expression of a marker protein provides a means to distinguish 

30 transfectcd cells from nontransfccted cells and is a reliable predictor of cDN A expression Ifom the 
recombinant vector. Marker proteins of choice include, e.g.. Green Ruorcscent Protein (GFP: 
Clontcch), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify iransfectcd cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake or 
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chromatography using anulxxlics specific lor GCREC. An iminunoaftlnity column is constructed by 
covalcntly coupling anli-GCREC aniibody lo an aciivated chromatographic resm, such as 
CXBr-dCUvaied SEPHAROSE (.Ajnersham Pharmacia Biotech). Alter the coupling, the resin is 
hkKked and washed according to the manutacturer's instructions. 
5 Media containing GCREC are passed over the inuiiunoaftlnity column, and the column is 

washed under conditioas that allow the preferential ahsorbance oi GCREC (e.g., high ionic strength 
buffers in the presence of detergent). The column is elutcd under conditjoas that disrupt 
antibody/GCREC binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or tfinx;yanaie ion), and GCREC is collected. 

10 \VI. Identification of MolecuJes Which Interact with GCREC 

Molecules which interact with GCREC may include agonists and antagonists, as well as 
molecules involved in signal traasduction, such as G proteins. GCREC, or a fragment thereof, is 
labeled with ^^'l Bolion-Hunter reagent. (See, e.g., Bolton A.E. and W.M. Hunter (1973) Bio:hem. J. 
1 33:529-539.) A fragment of GCREC includes, for example, a fragment comprising one or more of the 

15 three extracellular l(v>ps, the extracellular N-terminal region, or the tiiird intracellular Ux)p. Candidate 
molecules previously arrayed in the wells of a multi-well plate are incubated with the labeled GCREC, 
washed, and any wells with labeled GCREC complex are assayed Data obtained using different 
concentrations of GCREC are used to calculate values for the number, affinity, and associauon of 
GCREC with the candidate ligand molecules. 

20 AltemaUvely, molecules interacUng with GCREC are analyzed using the yeast two-hybrid 

system as described in Fields, S. and O. Song (1989) Nature 340:245-246, or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clonicch). 
GCREC may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) which 
employs the yeast two-hybrid system in a high-throughput manner to determine all interacuons l>eiween 

25 the proteins encoded by two large libraries of genes (Nandabalan, K. ei al. (2(MK)) U.S. Patent No. 
6,057,101). 

Potential GCREC agonists or antagonists may be tested for activation or inhibition of GCREC 
receptor activity using the assays described in sections XVII and XVIII. Candidate molecules may be 
selected Irom known GPCR agonists or antagonists, peptide libraries, or combinatorial chemical 
30 libraries. 

Methods for detecting interacuons of GCREC with inu-acellular signal transduction molecules 
such as G proteins are based on the premise that internal segments or cytoplasmic domains from an 
orphan G protein-coupled seven uansmembrane receptor may be exchanged with the analogous 
domains of a known G protein-coupled seven u-ansmembrane receptor and used to identify the G- 
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ligand/recepior- mediated modulaiion of cell proliieration. This assay measures ihe rate ot DNA 
synthesis in Swiss mouse 3T3 cells. A plasmid containing polynucicoudes enctxling GCREC is added 
to quiescent 3T3 cultured cells using transtccuon methods well known in the art. The u-ansiently 
translccied cells are then incubated in the presence ol l^H]thyniidine, a radioactive DNA precursor 

5 molecule. Varying amounts of GCREC ligand are then added to the cultured cells. Incorporation ol 
[^H]ihyniidine into acid-prccipilable DNA is measured over an appropriate Lime interval using a 
radioisotope counter, and the amount incorporated is directly proportional to the amount ol newly 
synthesized DNA. A linear dose-response curve over at least a hundred-fold GCREC ligand 
concentration range is indicative of receptor activity. One unit of activity p>er milliliter is defined as the 

10 concentration of GCREC producing a 50% response level, where 100% represents maximal 

incorpc^ration of ('H]thymidine into acid-precipitable DNA (McKay, L and I. Leigh, eds. (1993) 
Growth Factors: A Practical Approach , Oxford University Press, New York NY, p, 73.) 

In a further alternative, the assay for GCREC activity is based upon the ability of GPCR 
family proteins to modulate G protein-activated second messenger signal transduction pathways (e.g., 

15 cAMP; Gaudin, P. et al. (1998) J. Biol. Chem. 273:4990-4996). A plasmid encoding full length 
GCREC is transfected into a mammalian cell line (e.g., Chinese hamster ovary (CHO) or human 
embryonic kidney (HEK-293) cell lines) using methods well-known in the art. Transfected cells arc 
grown in 12-well trays in culture medium for 48 hours, then the culture medium is discarded, and the 
attached cells are gently washed with PBS. The cells are then incubated in culture medium with or 

20 without ligand for 30 minutes, then the medium is removed and cells lysed by treatment with 1 M 
perchloric acid. The cAMP levels in the lysate are measured by radioimmunoassay using methods 
well-known in the art. Changes in the levels of cAMP in the lysate from cells exposed to ligand 
compared to those without ligand are proportional to the amount of GCREC present in the u-ansfected 
cells. 

25 To meaiiure changes in inositol phosphate levels, the cells are grown in 24-well plates 

containing 1x10^ cells/well and incubated with inositol-frec media and [^H|myoinositol, 2 M,Ci/well, for 
48 hr. The culture medium is removed, and the cells washed with buffer containing 10 niM LiCl 
followed by addition of ligand. The reaction is slopped by addition of perchloric acid. Inositol 
phosphates are extracted and separated on Dowex AG1-X8 (Bio-Rad) anion exchange resin, and the 

30 total labeled inositol phosphates counted by liquid scintillation. Changes in the levels of labeled 

inositol phosphate from cells exposed to ligand compared to those without ligand are proportional to 
the amount of GCREC present in the transfected cells. 
XVI II. IdentificaUon of GCREC Ligands 

GCREC is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 
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What IS clajmed is: 

1. An isolated polypcpude comprising an amino acid sequence selected Irom the group 
const sung of: 

s a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -2 L 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-21, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisung of SEQ [D NO: 1-21, and 

10 d) an immunogenic fragment of an amino acid sequence selected from the group consisung 

of SEQ ID NO: 1-21. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1- 

21. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An ist)lated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:22-42, 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypepUde of claim 1, the method comprising: 

a) cultunng a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
compnses a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1 , and 

35 b) recovering the polypeptide so expressed. 
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U). An isolated antib<,dy which spec.fically binds to a polypepude of cla.m 1 . 

n . An .solated po.ynuc.eoUdc compns.n, a po.ynuc.eot.de sequence selected fro. the 

group consisting of. . ceo in NO 22-42 

a polynucleoUde sequence selected tron. the group cons.shng ot SEQ ID NO. 2 42. 
h) a naturally occurring polynucleotide sequence having at le.st 90.. sequence identity to a 
polynucleoude sequence selected from the group consisUng of SEQ ID NO:22-42. 

c) a polynucleoUde sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 
10 e) an RNA equivalent of a)-d). 

, 2. An isolated polynucleotide comprising at lea.st 60 conUguous nucleotides of a 
polynucleotide ot claim 1 1 . 

, 3 A n,.,hod ,0, «eca.. a .arge, po.ynuCeoUde in a ..mp.e, .aid ,a,ge, po.y.uCeoOde 
.av,„g a sequence of a polynucleCde of claim 1 1 . the memod comprlsmg: 

a, „y«ldl.ln. ^ .a.ple w„» a probe ...prlsln, a, leasf 20 co„.i,uo„. ™cleoude» 
.o.p„.:,:se.e„ceco.p™,osald...,.>lyn.leoUde.„.e.^pK 
specllcally l„..d,^es fo said ,a,g« polynucleoUde. under condiUons whereby a hybrrdu u„n 
,„ o^plcK , foled ...ween sard pr„^ and ..d ,ar,el polynucleoUde or fra,.en. .hereo an 
„de,ecUn„hep,e.enceorabsenceofsa.dhybrld,zaUcncomp,e.and.opuo„al,y.,. 

present, the amount ihereot, 

,4. A nrchod Of C.m ,3, wherein ,hc probe co.pr,ses a, leas, 60 conUguous nuCeoUdes. 

,5 A ™,hod for delecnng a targe, polynucleoUde In a santplc, s^d targe. ^,lynucleoUde 
hav,n= a sequence of a polynucleoUde o, claim 1 1 . ,hc n,ed,od compr,s,ng: 

: IpUfylng si target „„nucleoUde or fragn,ent ,her»f u.n, polymerase cbatn reacuon 

,„ """":::,:::„g ,he p.e..e „ .^..^ * - 

thereof, and, optionally, if present, the amount thereof. 

,6. A composiUon comprising an effective amount of a polypeptide of Claim 1 and a 
pharmaccutically acceptable cxcipicnt. 
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17. A composition of claim 16, wherein ihe polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-21 

18. A method for treating a disease or condition ass(x;iated with decreased expression of 
funcuonal GCREC, comprising administering to a patient in need of such treaiment the composition 
of claim 16. 

19. A method tor screening a compound for effectiveness as an agomst of a polypeptide ol 
claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
h) detecting agonist activity in the sample, 

20. A composition comprising an agonist compound identified by a methcxi of claim 19 and 
a pharmaceulically acceptable excipient. 

21 . A method for treating a disease or condition associated with decreased expression of 
functional GCREC, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 
of claim 1, the method comprising: 

a) exposmg a sample comprising a polypeptide of claim I to a compound, and 

b) detecting antagonist activity in the sample. 

23. A composition comprising an antagonist compound identified by a method of claim 22 
and a pharmaceulically acceptable excipient. 

24. A method for treating a disease or condition associated witli ovcrcxpression of functional 
GCREC, comprising administcnng to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that spccitlcally binds to the polypeptide of claim 
1, said methcxl comprising the steps of: 

a ) combining the p<:)lypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
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compound tM spetttically b,n* to .he p..lyrcpu* "f ^laim 1 

, „, f- 0 compound ma, ...lu.a.es ..e acUv,,, o, .he po.^^pu* ... 

lpa.:.a,eaa,v,„o, .epo>^a..oU.,a.n, , p,cscnce or compound 
„.J.:,;on:po,,pcp.*.---„,Hea«e„c.„a.e,e.co,„.«n.w^^^^^^^^^ 
.e po.pep.,Oe „r Cai. , ,n « p,csc„e o, * .cs, a....n. . .»..ca„vc 

compound n»dula,es ,he acUvUy .,1 .he polypep.ide of da,n, . . 

„ . n,e.h«i ,0, ..eenlns a comp-«.^ >« «T=^*e»cs. .n a,.en..g cxpr»»,n o, a .a.« 
po„„u*.de. «.n .a.d ,a.ae. ^-ynuCcOde »n.p.,se. a sequence o. da,. 5, .e.h.. 

'"'""'aT expos.n. a samp. co.p..s,n, ,he ,a..e, po.nuc,eo..de .o a com^nd. unde. cond,.,on. 
suitable lor ,he exp,ess,on of the target polynueleotide. 

h, detecung altered expression of the targe. polynueleoUde, and 

Tpanng .he expr.*n of the targe. ^,ynuc,eo„de in .e presence of varyrng amounts o 

the compound and in the absence of the compound. 

.S A memod for asse,,s,ng toxiCy of a test compound. sa,d nrethot. conaprtsing: 
treaung a biologtea. sample containing nucfeic acids w,th the ,es, compound; 

bning Jnucleic acids of ^ Ireated biological san.ple with a probe compns.ng at 
, ,cas,2r.huUesofap.,lynuc,eo.e„fcla,m,lu— 

<.n,ni .s lormcd between said probe and a target polynucleotide 

1 1 or fragment thcreol ; 

ouantifv.nc the amount ol hybridizaiion complex; and 

_„lyblat;ncomp,ex,„anuntreal.b,olog,calsample« 
„„„c„, „, nybr,d,.ation complex 1. the treated hioiogica, sample .nd.cati.e to. 

compound. 



oi 



30 



106 



wo <H 5^085 PCT/LSOl/03455 

<-llG> INCYTE GENOMICS, INC. 
3AUGHN, Mar i ah R . 
AU - YCUMG , Jan 1 c G 
YUE, Henry 

<• 120> G-PRCTEIN COUPLED RECEPTORS 



^ 13 C> 


PI- oc 


3 2 PCT 


< 14 G> 

< 14 1 > 


To Be 
Herew 


Ass igned 
ith 


<1S0> 
< 151> 


60/180.093; 60/182,045 
2000-02-02; 2000-02-11 


< 160> 
<- 17 0> 


42 

PERL 


Program 


' 2 1 0 > 
' 2 1 1 > 

• :212> 

< 213> 


1 

470 
PRT 
Homo 


sapi ens 


<^ 2 2 0 > 

<:22 1> 
<; 2 2 3 > 


m sc_ feature 

Incyte ID No: 7472033CD1 


<22C> 
<: 2 2 1 > 
<^222 > 
<223 > 


unsure 

59, 128, 309 
unknown or other 


<; 4 0 0 > 







Met 


Asn 


Gin 


Thr 


Glu 


Pro 


Ala 


Gin 


Leu 


Ala 


Asp 


Gly 


Glu 


His 


Leu 


1 








5 










10 










15 


Ser 


Gly 


Tyr 


Ala 


Ser 


Ser 


Ser 


Asn 


Ser 


Val 


Arg 


Tyr 


Leu 


Asp 


Asp 






20 










25 










30 


Arg 


His 


Pro 


Leu 


Asp 
35 


Tyr 


Leu 


Asp 


Leu 


Gly 
40 


Thr 


val 


His 


Ala 


Leu 
45 


Asn 


Thr 


Thr 


Ala 


He 
50 


Asn 


Thr 


Ser 


Asp 


Leu 
55 


Asn 


Glu 


Thr 


Xaa 


Ser 

60 


Arg 


Pro 


Leu 


Asp 


Pro 


Val 


Leu 


He 


Asp 


Arg 


Phe 


Leu 


Ser 


Asn 


Arg 






65 










70 










75 


Ala 


Val 


Asp 


Ser 


Pro 


Trp 


Tyr 


His 


Met 


Leu 


He 


Ser 


Met 


Tyr 


Gly 








80 










85 










90 


Val 


Leu 


He 


Val 


Phe 

95 


Gly 


Ala 


Leu 


Gly 


Asn 
100 


Thr 


Leu 


Gly 


Cys 


Tyr 

105 


Ser 


Pro 


Ser 


Ser 


Gly 
110 


Ser 


Pro 


Ser 


Cys 


Ala 
115 


Leu 


Leu 


Ala 


He 


Trp 

120 


Phe 


He 


Leu 


Asn 


Leu 
125 


Ala 


He 


Xaa 


Gly 


Gin 

130 


Ser 


Lys 


Cys 


Glu 


Ser 
135 


His 


Pro 


Ser 


Gly 


Leu 


Ser 


Asp 


Leu 


Leu 


Leu 


Cys 


Leu 


Val 


Thr 


Met 








140 










145 










150 


Pro 


Leu 


Thr 


Leu 


Met 
155 


Glu 


He 


Leu 


Ser 


Lys 
150 


Tyr 


Trp 


Pro 


Tyr 


Gly 
165 


Ser 


Cys 


Ser 


He 


Leu 


cys 


Lys 


Thr 


He 


Ala 


Met 


Leu 


Gin 


Ala 


Leu 








170 










175 










180 


Oys 


He 


Phe 


Val 


Ser 


Thr 


He 


Ser 


He 


Thr 


Ala 


He 


Ala 


Phe 


Asp 








185 










190 










195 


Arg 


Tyr 


Gin 


Val 


He 


Val 


Tyr 


Pro 


Thr 


Arg 


Asp 


Ser 


Leu 


Gin 


Phe 








200 








205 










210 


Val 


Gly 


Ala 


Val 


Thr 


He 


Leu 


Ala 


Gly 


He 


Trp 


Ala 


Leu 


Ala 


Leu 








215 










220 










225 


Leu 


Leu 


Ala 


Ser 


Pro 
230 


Leu 


Phe 


Val 


Tyr 


Lys 
235 


Glu 


Leu 


He 


Asn 


Thr 
240 


Asp 


Thr 


Pro 


Ala 


Leu 


Leu 


Gin 


Gin 


He 


Gly 


Leu 


Gin 


Asp 


Thr 


He 



1/31 



BNSDOCID ^WO 0157085A2 



PCT/l S(n/034 



\\0 01/57085 

245 ^ ;.sn Gly Arg Phe Tyr 

Pro cys He Glu Asp Trp Pro Ser Ar| Asn 

: l ... ^ v. v. ^ 

27 5 , X- TH,^ A^n Lvs Leu Lys Ser 

.,= 1 jxia "-/r Phe Gly I^e Tyr A:=n uy^ qq 
Tie Val ser Val Ala ^/r n -i j 

290 , ^ val Glu Arg Gly 

o^v, '^=.1 val Ala val Gin Xaa Arg Lys 
A.g He Thr val Val 3,0 

Arg .rg Me. Lys Arg Thr Asn Cys Lau Leu II. 0 
V , III Trp Leu Pro Leu Asp Phe Phe Asn Leu T-yr 
He Phe Gly Val Ser Trp Le 

335 ^ ^ Ser Met Leu Vai i^ry 

A^a ASP Met Glu Arg Ser Pro Val Thr Gin 

s„ s„ 5» 

1 .eu ... ry. Ill .SP - 0 

3 80 ^ ^ i.y,^ val Ala Leu Asn 

T^>. r^vs Ara cys ser Asp Thr A^n vd 
Gin Glu Leu Leu ^^ys Arg y 

™. T.. cv. .sn va. o.„ .Ja Ma ..a . , ^ . 

::; .v. "° 

. V 1 -V or, ill «, V- "° -.1 ^ 0 

oly His Arg iju His Asp Arg H.s Hxs Glu Gly 

' 455 
Asn He val His His 

/\SIi XX 

<210> 2 

<211> 326 

<212> PRT 

<213> Homo sapiens 

' . „n Oln A.P Gla II. s.r "It 
! I.e .V. n. O.. Ma P.O ..u .V. 

Phe Arg Arg s.r II. V 25 p„ 

XI. pro =ly ^e. Olu alu s.r Cl„ Hj; Trp 

II. I..U T^r I..U ..u Al. L.U val =ly Asn val ^ 

: iia X.. ^1 ::: i 

ser Thr Ala Pro Lys Ala Leu Ala Val Leu 

95 Met Phe Phe He His 

Olu lie Gly Tyr He Val Cys Leu He Hn .0 

val Leu val Ala Met Ala Leu 

Ma Phe ser Ser Met Glu Ser Gly Val ^ 135 
V ^ All He cys H.s Pro Leu His His Ser Thr He 

ASP Arg Tyr Va. Ala Cy ^^^^ 

.eu H.s pro Gly val He Oly Arg He .1. ,^5 
, ,eu Leu S Leu He Pro Phe Pro He Leu Leu Gly Th^ 

' Z Z cys III Ala Thr He He cl^ H.s Ala .yr Cys Glu 



Leu 

2/31 



3NSOOC1D <WC 0167065A2 



w o 01 5^0M5 



PCT/tSOl/03455 



ISS 190 195 

His Met Ala Val Val Lys Leu Ala Cys Ser Glu Thr Thr Val Asn 

200 205 210 

Arg ^la Tyr Gly Leu Thr Met Ala Leu Leu Val lie Gly Leu Asp 

: : 5 2 2 0 2 2 5 

Val Leu Ala lie Gly Val Ser Tyr Ala His lie Leu Gin Ala Val 

230 235 240 

Leu Lys Val Pro Gly Ser Glu Ala Arg Leu Lys Ala Phe Ser Thr 

245 250 255 

Cys Gly Ser His He Cys Val He Leu Val Phe Tyr Val Pro Gly 

260 265 270 

He Phe Ser Phe Leu Thr His Arg Phe Gly His His Val Pro His 

275 280 285 

His Val His Val Leu Leu Ala Thr Arg Tv^r Leu Leu Met Pro Pro 

290 295 300 

Ala Leu Asn Pro Leu Val Tyr Gly Val Lys Thr Gin Gin He Arg 

305 310 315 

Gin Arg Val Leu Arg Val Phe Thr Gin Lys Asp 

320 325 



<210> 3 

<211> 315 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7472035CD1 



<400> 3 



Met 


Glu 


Thr 


Pro 


Ala 


Ser 


Phe 


Leu 


Leu 


Val 


Gly 


He 


Pro 


Gly 


Leu 


1 








S 










10 










15 


Gin 


Ser 


Ser 


His 


Leu 


Trp 


Leu 


Ala 


He 


Ser 


Leu 


Ser 


Ala 


Met 


Tyr 










20 








25 










30 


He 


He 


Ala 


Leu 


Leu 


Gly 


Asn 


Thr 


He 


He 


Val 


Thr 


Ala 


He 


Trp 










35 








40 










45 


Met 


Asp 


Ser 


Thr 


Arg 


His 


Glu 


Pro 


Met 


Tyr 


Cys 


Phe 


Leu 


Cys 


Val 








50 










55 










60 


Leu 


Ala 


Ala 


Val 


Asp 
65 


He 


Val 


Met 


Ala 


Ser 
70 


Ser 


Val 


Val 


Pro 


Lys 
75 


Met 


Val 


Ser 


He 


Phe 


Cys 


Ser 


Gly 


Asp 


Ser 


Ser 


He 


Ser 


Phe 


Ser 










80 








85 










90 


Ala 


Cys 


Phe 


Thr 


Gin 


Met 


Phe 


Phe 


Val 


His 


Leu 


Ala 


Thr 


Ala 


Val 








95 










100 










105 


Glu 


Thr 


Gly 


Leu 


Leu 


Leu 


Thr 


Met 


Ala 


Phe 


Asp 


Arg 


Tyr 


Val 


Ala 








110 










115 










120 


He 


Cys 


Lys 


Pro 


Leu 


His 


Tyr 


Lys 


Arg 


He 


Leu 


Thr 


Pro 


Gin 


Val 






125 










130 










135 


Met 


Leu 


Gly 


Met 


Ser 


Met 


Ala 


He 


Thr 


He 


Arg 


Ala 


He 


He 


Ala 








140 










145 










150 


He 


Thr 


Pro 


Leu 


Ser 


Trp 


Met 


Val 


Ser 


His 


Leu 


Pro 


Phe 


Cys 


Gly 










155 








160 










165 


Ser 


Asn 


Val 


Val 


Val 
170 


His 


Ser 


Tyr 


Cys 


Glu 
175 


His 


He 


Ala 


Leu 


Ala 

180 


Arg 


Leu 


Ala 


Cys 


Ala 


Asp 


Pro 


Val 


Pro 


Ser 


Ser 


Leu 


Tyr 


Ser 


Leu 






185 










190 










195 


He 


Gly 


Ser 


Ser 


Leu 


Met 


Val 


Gly 


Ser 


Asp 


Val 


Ala 


Phe 


He 


Ala 








200 










205 










210 


Ala 


Ser 


Tyr 


He 


Leu 


He 


Leu 


Lys 


Ala 


Val 


Phe 


Gly 


Leu 


Ser 


Ser 








215 










220 










22 5 


Lys 


Thr 


Ala 


Gin 


Leu 


Lys 


Ala 


Leu 


Ser 


Thr 


Cys 


Gly 


Ser 


His 


Val 








230 








235 










240 


Gly 


Val 


Met 


Ala 


Leu 


Tyr 


Tyr 


Leu 


Pro 


Gly 


Met 


Ala 


Ser 


He 


Tyr 








245 










250 










255 


Ala 


Ala 


Trp 


Leu 


Gly 


Gin 


Asp 


Val 


Val 


Pro 


Leu 


His 


Thr 


Gin 


Val 








260 










265 










270 


Leu 


Leu 


Ala 


Asp 


Leu 


Tyr 


Val 


He 


He 


Pro 


Ala 


Thr 


Leu 


Asn 


Pro 



3/31 



BNSDOCID cWO 0157085A2 



PCT/l SOl/03455 

WO *n/57085 



280 285 



„e TV. Oly III .ys .'-n ..u «a =lu Ar<, He .rp 

... M« V.1 Pn. «p H.s S„ .eu =W S.. 

i 305 



<210> 4 

<211> 314 

<212> PRT 

<213> Homo sapiens 

<220> 



tTs^i s.r .sn U. T.. ..u T» His Pro Th. P.- 
..i val II. Pr^o olu His ..u H.s xi. T.p li. 

„. P„ P.. CVS L^S .ia T-vr ^r .eu L.u L,u =lv .s„ cvs 

..u I.,u if. II. =i„ Ma Ala Aia ..u Hi. =lo PrJ 

7.1^ z^iA Tie Asp Leu Val Leu 
.«t Tyr L,u PB. l.au M. '-l' "'^ 75 

3.r s.r s.r M. Pro Lvs ^ ^1 

„p .r, Oiu U. 4n P.. P.. Ma cvs ..u M. .in „.r P.. 

HIS s.r P.. sll li. M.. Oiu s.r Ma vai ..u ..u M. M.t 
Ma P.. .sp val M. li. Cvs .v- Pro ... His ^r r 

^, TVS lie Gly Met Ala Ala 

Lys val Leu Thr Gly Ser Leu He Thr Lys y 

val Ala Ar. Ala ifl r.r Leu Me. ... Pro Leu P.o Pne Leu L^u 
cys P.e H.S '^l cys A.. Gly Pro Val He Ala H.s Cys Ty. 
olu HIS Me. ^1° val Val A.. Leu Ala Cys Gly Asp ..r S.r 



Phe 


Asn 


Asn 


He 


Tyr 

200 


Gly 


Leu 


Asp 


Leu 


Leu 


Leu 
215 


Val 


Ala 


Val 


Leu 


Leu 


Leu 
230 


Ala 


Gly 


Thr 


cys 


Val 


Ser 

245 


His 


Thr 


Val 


Val 


He 


ser 
260 


Ser 


Ala 


pro 


His 


Val 


His 

275 


He 


Pr 0 


Pro 


Met 


Val 


Asn 

290 


Pro 


He 


Arg 


Glu 


Ser 


He 

305 


Leu 



135 _ . 

Tyr He Phe He Leu Gin 

-too 

Ala Arg Tyr Lys Ala Phe 
L6U Ala Phe Tyr Thr 

250 

Arg Val Ala Arg His .-.la 

2 6 5 ^ ' ^ 

Asn Phe TVr Leu Leu ?he 

280 

- Gly val Lys Thr Lys Gin 
295 , ^ 



310 



<21C> 5 

<211> 321 

<212> PRT 

<213> Homo sapiens 

<22C> 

<221> misc^feature 



4/31 



3NSDOCID «^WO G157085A2 



w o 01/5'^O85 



<223> Incyte ID No: 7472037CD1 
<4 00> 5 





Ala 


His 


Gin 


Ala 

5 


Pro 


Glu 


Lys 


I 

Leu 


Val 


Thr 


Glu 


Phe 


Leu 


Leu 


Val 










20 








Leu 


Arg 


Pro 


Thr 


Leu 

35 


Phe 


I le 


Leu 


Thr 


Leu 


Ser 


Gly 


Asn 


Ala 


Thr 


He 










50 








Arg 


Thr 


Leu 


His 


Thr 


Pro 


Met 


Tyr 










65 








Leu 


Ser 


Glu 


Thr 


Cys 


Tyr 


Thr 


Leu 










80 








Ala 


His 


Leu 


Leu 


Met 


Glu 


Ser 


Gin 










95 








Arg 


Ala 


Gin 


Met 


Phe 


Phe 


Phe 


Leu 










110 








PhG 


Leu 


Leu 


Thr 


Leu 


Met 


Gly 


Tyr 










12 5 








His 


Pro 


Leu 


Arg 


Tyr 


Ser 


Val 


He 










140 








Cys 


Leu 


Gly 


Ala 


Leu 


Val 


Phe 


Cys 










155 








lie 


Glu 


Thr 


Cys 


Met 


He 


Phe 


Ser 










170 








His 


Val 


Glu 


His 


Phe 


Phe 


Cys 


Asp 










185 








Ser 


Cvs 


Asp 


Glu 


Ser 


Ser 


Leu 


Lys 










200 








Ser 


lie 


Leu 


Val 


Val 


Leu 


Val 


Ser 










215 








Tyr 


Ala 


Phe 


He 


Val 


Ala 


Ala 


He 










230 








Gly 


Arcj 


Arcr 


Lys 


Ala 


Phe 


Ser 


Thr 










245 








Val 


He 


Val 


His 


Phe 


Gly 


Cys 


Ala 










260 








Asp 


Ser 


Gly 


Ala 


Asn 


Pro 


Ser 


Gin 










275 








Tyr 


Thr 


Val 


Val 


Thr 


Pro 


Leu 


Leu 










290 








Arg 


Asn 


Lys 


Glu 


Val 


Arg 


Val 


Ala 










305 








Gly 


Cys 


Gly 


Ala 


Phe 


Lys 














320 









<210> 6 

<211> 331 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472039CD1 

<400> 6 

Met Ser Pro Asp Gly Asn His Ser 

1 5 
Leu Ala Gly Leu Pro Asn Leu Asn 

20 

Ser Val Phe Leu Leu Val Tyr Leu 
35 

Leu He Val Gly Val Val Arg Ala 
50 

Met Tyr Phe Phe Leu Gly Asn Leu 



Gin 


Gin 
10 


Asp 


Asn 


Gly 


Thr 


Trp 
15 


Gly 


Phe 


Ser 


Asn 


Leu 


Pro 


Glu 


2 5 










3 0 


Phe 


Leu 
40 


Leu 


Thr 


■Tyr 


Leu 


Val 
45 


He 


Thr 
55 


He 


He 


Gin 


Val 


Asp 
60 


Arg 


Phe 

70 


Leu 


Ala 


Val 


Leu 


Ser 
75 


Val 


Thr 
85 


He 


Pro 


Asn 


Met 


Leu 

90 


Ala 


He 

100 


Ser 


He 


Ala 


Gly 


Cys 
105 


Gly 


Leu 

115 


Gly 


Cys 


Ser 


His 


Cys 
120 


Asp 


Arg 

13 0 


Tyr 


Val 


Ala 


He 


Cys 
135 


Met 


Arg 
145 


Pro 


Thr 


Val 


Cys 


Leu 
150 


Ser 


Gly 
160 


Phe 


Ser 


Val 


Ala 


Leu 
165 


Ser 


Pro 
175 


Phe 


Cys 


Gly 


Ala 


Gly 

180 


He 


Ala 
190 


Pro 


Val 


Leu 


Lys 


Leu 
195 


Gly 


Leu 

205 


Gly 


He 


Phe 


Phe 


Leu 
210 


Phe 


Leu 

220 


Phe 


He 


Leu 


Leu 


Ser 
225 


Val 


Arg 

235 


He 


Pro 


Ser 


Ala 


Ser 
240 


Cys 


Ala 


Ala 


His 


Leu 


Thr 


Val 


250 










255 


Ser 


He 

265 


He 


Tyr 


Leu 


Arg 


Pro 
270 


Asp 


Arg 

280 


Leu 


Val 


Ala 


Val 


Phe 
285 


Asn 


Pro 
295 


Val 


Val 


Tyr 


Thr 


Leu 

300 


Leu 


Arg 
310 


Lys 


Asn 


Leu 


Ala 


Arg 
315 



Ser 


Asp 


Pro 


Thr 


Glu 


Phe 


Val 




10 










1 5 


Ser 


Ala 


Arg 


Val 


Glu 


Leu 


Phe 




25 










30 


Leu 


Asn 


Leu 


Thr 


Gly 


Asn 


Val 




40 










45 


Asp 


Thr 


Arg 


Leu 


Gin 


Thr 


Pro 




55 










60 


Ser 


Cys 


Leu 


Glu 


He 


Leu 


Leu 
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70 ^ = 



Thr S„ Val He ill Pro I.ys M.t Leu S.r P.e Mu S,r A„ 

= ™r ne sfr P.e .i, Al. Cy, II. T.. =ln P.. T-.r PJ, 

Tyr Pn. Phe Leu =?y ser clu Ph. L.u Leu Leu M. v.l «e. 

se. M. .sp A., "° 

Leu Leu Met Ser =Iy V.l Cys Ph. Ar| Val Ala Leu Ala Cy, 

„p val Gly =ly "e^ val P« V.l L.u =ly Pro Thr Val Ala V.l 
Ala Leu L.u Pro III cys Lys cl„ oly Ala val v.l =ln His Ph. 
Ph, cys A=P ser lly pro L.u L.u Ar, Leu Al. cys Thr As„ Thr 
Lys Lys L.U Clu III Thr Asp Ph, V.l L.u Al. Ser Leu Val lie 
val ser S.r L.u III n. Thr Al. val s.r Tyr cly L.u 11. val 
L.O Ala val Leu III 11. Pro s.r Al. ser cly Ar, cl„ Lys Al. 
Ph. ser Thr cys HI s.r Hi. L.u He v.l val Thr L.u Ph. J.r 
Oly s.r Ala II. Phi L.u Tyr val Ara P|| cm ser cly s.r 
V.1 ASP Thr AS„ ill Ala V.l Thr v,l II. Thr Thr Phe v.l Thr 
P.„ L,u Leu As„ III Ph. 11. Tvr Al. L.u Arg As„ clu cl„ val 
Lys alU Ala L.u HI Asp H.t Ph. Ar, Ly. V.l Val Al. Cly V,l 
L.U Cly AS„ L.U III Leu Asp Lys cys L.U S.r =lu Lys Al. V.l 



320 

Lys 



<210> 7 
<211> 337 
<212> PRT 

<213> Homo sapiens 



<220> 

<'^T1> misc feature 

<223> incyte ID No: 7472040CD1 

Mt°°Gli Asn ASP ser Val Ser Tyr Glu Tyr Gly Asp ^r Ser Asp 
.ei ser Asp Arg Pro Val Asp Cys Leu Asp Gly Ala Cys Leu Ala 
ne ASP pro Leu .11 Val Ala Pro Leu Pro Leu Tyr Ala Ala .le 
P.e Leu val Gly Va! Pro Gly Asn Ala Me. Val Ala Trp Val Ala 
Oly Lys val Ala Arg Arg Arg Val Gly Ala Thr Trp Leu Leu H.s 
.eu Ala val Ala Asp Leu Leu Cys Cys Leu Ser Leu Pro He Leu 
.la val pro lie A^la Arg Gly Gly His Trp Pro Tyr Oly Ala Val 
,,y cys Arg Ala L^u Pro Ser He lie Leu Leu Thr Me. ^r Ala 
ser val Leu Leu ^eu Ala Ala Leu Ser Ala Asp Leu Cys Phe Leu 
.la Leu Gly Pro III Trp Trp Ser Thr Val Gin Arg Ala Cys Gly 
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Va. 1 


Gin 


Val 


Ala 


Cys 


G i y 


Ala 


Ala 


Trp 










155 










Thr 


Val 


Pro 


Ser 


Ala 

I'^O 


I le 


Tyr 


Arg 


Arg 


Pro 


Ala 


Arg 


Leu 


Gin 


Cys 


Val 


Val 


Asp 










185 










Thr 


Glu 


A.sn 


Ala 


Val 


Thr 


Ala 


He 


Arg 










200 










G 1 V 


Pro 


Leu 


Val 


Ala 


Val 


Ala 


Ser 


Cys 








215 












Ala 


Ala 


Arg 


Arg 


Cys 


Arg 


Pro 


Leu 










230 










Glv 


PhG 


Phe 


Val 


Cys 


Trp 


Ala 


Pro 


Tyr 










245 










Leu 


Thr 


Val 


Ala 


Ala 


Pro 


Asn 


Ser 


Ala 










260 










Arg 


Ala 


Glu 


Pro 


Leu 


lie 


Val 


Gly 


Leu 










275 










Lou 


As n 


Pro 


Met 


Leu 


Phe 


Leu 


Tyr 


Phe 










290 










Arg 


Ser 


Leu 


Pro 


Ala 


Ala 


Cys 


His 


Trp 








305 










Gly 


Gin 


Asp 


Glu 


Ser 


Val 


Asp 


Ser 


Lys 
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Leu 


Val 


Ser 


Glu 


Met 


Glu 


Val 














335 











PCT/IS01/(M455 



145 










150 


Thr 


Leu 


Ala 


Leu 


Leu 


Leu 


160 










165 


Leu 


His 


Gin 


Glu 


His 


Phe 


1^5 










ISC 


Tyr 


Gly 


Gly 


Ser 


Ser 


Ser 


190 










195 


Phe 


Leu 


Phe 


Gly 


Phe 


Leu 


2 05 










210 


His 


Ser 


Ala 


Leu 


Leu 


Cys 


220 










2 2 5 


Gly 


Thr 


Ala 


He 


Val 


Val 


235 










240 


His 


Leu 


Leu 


Gly 


Leu 


Val 


250 










255 


Leu 


Leu 


Ala 


Arg 


Ala 


Leu 


265 










270 


Ala 


Leu 


Ala 


His 


Ser 


Cys 


280 










285 


Gly 


Arg 


Ala 


Gin 


Leu 


Arg 


295 










300 


Ala 


Leu 


Arg 


Glu 


Ser 


Gin 


310 










315 


Lys 


Ser 


Thr 


Ser 


His 


Asp 


325 










330 



<210> 8 

<211> 1473 

<212> PRT 

<2 1 3 > Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4250893CD1 



<400> 8 



Met 


Ala 


Gly 


Gly 


Ala 


Trp 


Gly 


1 








5 






Leu 


Lys 


Lys 


Glu 


Glu 
20 


Leu 


Lys 


Lys 


Ala 


His 


Ser 


Arg 


Ser 


Ser 








35 






Glu 


Lys 


Thr 


Ser 


Gly 


Met 


Glu 








50 






Tyr 


Gly 


Glu 


Gin 


Arg 

65 


Ala 


Trp 


Gin 


Met 


Gly 


Leu 


Arg 

80 


Ser 


Leu 


Gly 


His 


Ser 


Pro 


Ser 


Phe 


Pro 








95 






Gly 


Ser 


Pro 


Ser 


Gin 


Pro 


Thr 








110 






He 


His 


Glu 


Leu 


Pro 
125 


Ala 


Gly 


Val 


Leu 


Arg 


Gin 


Leu 
140 


Pro 


Asp 


He 


Ser 


Ala 


Ser 


His 
155 


Val 


Tyr 


His 


Glu 


Ser 


Pro 


Ser 
170 


Gin 


Glu 


Ala 


Val 


Leu 


Gly 


Ser 
185 


Trp 


Gly 


Pro 


Arg 


Glu 


Gin 


Glu 

200 


Ala 


Pro 


Thr 


Ser 


Gly 


He 


Tyr 


Tyr 


Thr 



Arg 


Leu 


Ala 


cys 


Tyr 


Leu 


Glu 


Phe 






10 










15 


Glu 


Phe 


Gin 


Leu 


Leu 


Leu 


Ala 


Asn 






25 










30 


Ser 


Gly 


Glu 


Thr 


Pro 


Ala 


Gin 


Pro 




40 










45 


Val 


Ala 


Ser 


Tyr 


Leu 


Val 


Ala 


Gin 






55 










60 


Asp 


Leu 


Ala 


Leu 


His 


Thr 


Trp 


Glu 




70 










75 


Cys 


Ala 


Gin 


Ala 


Gin 


Glu 


Gly 


Ala 




85 










90 


Tyr 


Ser 


Pro 


Ser 


Glu 


Pro 


His 


Leu 




100 










105 


Ser 


Thr 


Ala 


Val 


Leu 


Met 


Pro 


Trp 






115 










120 


Cys 


Thr 


Gin 


Gly 


Ser 


Glu 


Arg 


Arg 




130 










135 


Thr 


Ser 


Gly 


Arg 


Arg 


Trp 


Arg 


Glu 






145 










150 


Gin 


Ala 


Leu 


Pro 


Ser 


Ser 


Pro 


Asp 






160 










165 


Ser 


Pro 


Asn 


Ala 


Pro 


Thr 


Ser 


Thr 






175 










180 


Ser 


Pro 


Pro 


Gin 


Pro 


Ser 


Leu 


Ala 






190 










195 


Gly 


Thr 


Gin 


Trp 


Pro 


Leu 


Asp 


Glu 




205 










210 


Glu 


He 


Arg 


Glu 


Arg 


Glu 


Arg 


Glu 
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7 s 

220 



I^ys ser Clu Lys cly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
P.o pro Cln Ala kIs Thr ser Leu Gin Pro Hxs H.s Hxs Pro Trp 
Clu pro Ser Val frg Olu Ser Leu cys Ser Thr Trp Pro Trp Lys 
.s^ Glu ASP Phe A.Jn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin 
..g pro HIS pro Arg ser Gin Asp Pro Leu Val Lys Arg ser Trp 
pro ASP Tyr Val Glu Glu Asn Arg Gly H.s Leu He Glu He Arg 
^sp Leu Phe Gly III Gly Leu Asp Thr Gin Glu Pro Arg lie Val 
Xle Leu Gin Gly Ma Ala Gly lie Gly Lys Ser Thr Leu Ala Arg 
om val Lys Glu iia Trp Gly Arg Gly Oln Leu Tyr Gly Asp Arg 
Phe Gin His val Phe Tyr Phe Ser Cys Arg Glu Leu Ala Gin Ser 
.ys val val ser Leu Ala Glu Leu He Gly Lys Asp Gly Thr Ala 
Thr pro Ala Pro III Arg Gin He Leu Ser Arg Pro Glu Arg Leu 
.eu Phe Xle Leu ^sp Gly Val Asp Glu Pro Gly Trp Val Leu Gin 
Olu pro ser Ser c\u Leu Cys Leu H.s Trp Ser Gin Pro Gin Pro 
Ma ASP Ala Leu S Gly Ser Leu Leu Gly Lys Thr He Leu Pro 

, Ai= a^n TVir Thr Ala Leu Gin Asn 

Glu Ala ser Phe Leu He Thr Ala Arg Thr Tnr a 

.eu He pro Ser tH Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
P.e ser Glu Ser III Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr 
.sp Glu Arg Gin til He Arg Ala Phe Arg Leu Val Lys Ser Asn 
.ys Glu Leu Trp HI Leu Cys Leu Val Pro Trp Val Ser Trp Leu 
.la cys Thr Cys L^u Met Gin Gin Met Lys Arg Lys Glu Lys Leu 
Xhr Leu Thr Ser Lys Thr Thr Thr Thr Leu Cys Leu Hxs Tyr Leu 
.la Gin Ala Leu III Ala Gin Pro Leu Cly Pro Gin Leu Arg Asp 
.eu cys ser Leu Ma Ala Glu Gly He Trp Gin Lys Lys Thr Leu 
P.e ser Pro Asp Asp Leu Arg Lys His Gly Leu Asp Gly Ala lie 
X.e ser Thr Phe llu Lys Met Gly He Leu Gin Glu His Pro lie 
P.O Leu ser Tyr HI Phe He His Leu cys Phe Gin Glu Phe Phe 
Ma Ala Met Ser Val Leu Glu Asp Glu Lys Gly Arg Gly Lys 

„,3 ser Asn Cys fie He Asp Leu Glu Lys Thr Leu Glu Ma Tyr 
Oly He H.s Gly lIu Phe Gly Ala Ser Thr Thr Arg Phe Leu Leu 
Oly Leu Leu Ser Asp Glu Gly Glu Arg Olu Met Glu Asn He Phe 
H.s cys Arg Leu ^ Gin Gly Arg Asn Leu Met Gin Trp Val Pro 
ser Leu Gin Leu t2 Leu Gin Pro His Ser Leu Glu Ser Leu His 
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'- J ^ 


Leu 


Tvr 


Glu 


Thr 

7 2 5 


A T n 


P*s n 


Lvs 


Thr 


Phe 

7 3 0 


Leu 


Thr 


Gin 


Val 


Met 

735 


A 1b. 


His 


Phe 


Glu 


Glu 


Met 


civ 


Me t 


Cvs 


Val 


Glu 


Thr 


^ 3 D 


Met 


Glu 










7 4 0 










745 










75C 


Leu 


Leu 


Val 


Cvs 


Thr 


Phe 


Cvs 

y ~j 


lie 


' vs 


Phe 


Ser 


Ara 


His 


\-' a 1 


Lvs 










755 










760 










765 


^ y 


Leu 


Gin 


Leu 


lie 


Glu 


G ' V 


r a 


Gin 


His 


Ar g 


Ser 


Thr 


Trc 


Ser 










77 0 










775 










730 


Pro 


Thr 


Met 


Va 1 


Va 1 


Leu 


Phe 


Ar g 


Trn 


Val 


Pro 


Val 


Thr 


Asp 


Ala 










78 5 










790 










795 


i y I 


'T*'m 
i rp 


Gin 


lie 


Leu 


Phe 


Ser 


Va 1 


Leu 


Lys 


Val 


Thr 


Arg 


Asn 


Leu 










800 










805 










810 


Lys 


Glu 


Leu 


Asp 


Leu 


Ser 


G Iv 


As n 


Ser 


Leu 


Ser 


His 


Ser 


Ala 


Val 










815 










82 0 










82 5 


Lys 


^ tr X 


Leu 


*^y is 


Lys 


Thr 


Leu 


Ar g 


Ar g 


Pro 


Ar g 


Cvs 

v_ y ^ 


Leu 


Leu 


Glu 










83 0 










83 5 










840 


Thr 


Leu 


A r 5 


Leu 


Ala 


^y 


Cvs 


Gl V 

-L y 


Leu 


Thr 


Ala 


Glu 


As p 


Cvs 

v^y o 


Lys 










84 5 










850 










855 


Asp 


eu 


Ala 


Phe 


Gly 


Leu 


Ar 


Ala 


Asn 


Gin 


Thr 


L eu 


Thr 


Glu 


Leu 








8 6 0 










865 










87 0 


sp 


Leu 


S e r 


Phe 


Asn 


Va 1 


Leu 


Thr 


Asp 


Ala 


Glv 


Ala 


Lys 


His 


Leu 








87 5 










880 










88 5 


cy s 


Gin 


rt X y 


L eu 


Ar 5 


Gin 


Pro 


Ser 


Cvs 

^wy i> 


Lys 


Leu 


Gin 


Ar g 


Leu 


Gin 










8 9 0 










895 










9 00 


Leu 


V d X 


O fcr X 


L,ys 


Gly 


Leu 


Thr 


Ser 


Asp 


Cvs 


Cvs 
k_y t> 


Gin 


Asp 


Leu 


Ala 










Q 0 

U J 










910 










915 


o er 


V ct X 


Leu 


s e r 


rt. X <a 


o 
o ^x 


Pro 


Ser 


L eu 


Lys 


Glu 


Leu 


Asp 


Leu 


Gin 










9 2 0 










92 5 










9 3 0 




Asn 


Asn 


Leu 


Asp 


Asp 


V d X 


Gly 


Val 


Ar 


Leu 


Leu 


Cv <=; 
L,y :=> 


Glu 


Gly 










9 3 5 










94 0 










945 


Leu 


Ar 5 


n X o 


r^x iw* 


ri X d 


(^ys 


Lys 


Leu 


lie 


Ar ^ 


Leu 


Gly 


Leu 


Asp 


Gin 










9 5 0 










9 55 










9 60 


Trir 


i ilX 


L eu 


C i2> "T* 

O tr X 


Asp 


X 


Met 


rtx y 


Gin 


Glu 


Leu 


Ar g 


Ala 


Leu 


Glu 










y \j 










9 7 0 










97 5 




XU 


Lys 


Pro 


\j ±11 


Leu 




lie 


Phe 


S er 


Ar^ 


A r g 


Lys 


Pro 


Ser 










Q ft n 

J O \J 










9 85 










9 9 0 


V a. X 


l^ifcr L. 


X liX 


X w 


X 1 i X 


Glu 


Gly 


Leu 


Asp 


Thr 


Gly 


Glu 


Me t 


Ser 


Asn 










Q q S 
y y ^ 










10 0 0 










10 05 


" er 


Thr 


S er 


Ser 


Leu 


Lys 


A r Q 


Gin 


Ar 5 


Leu 


Glv 

vj xy 


Ser 


Glu 


Arg 


Ala 










1010 










1015 










1020 




S er 


His 


V d X 


Ala 


Gin 


Ala 


As n 


Leu 


Lys 


Leu 


Leu 


Asp 


Val 


Ser 










102 5 










103 0 










103 5 


Lys 


lie 


Phe 


Pro 


lie 


Ala 


Glu 


lie 


Ala 


Glu 


Glu 


Ser 


Ser 


Pro 


Glu 








104 0 










1045 










1050 


Va. 1 


Val 


Pro 


Val 


Glu 


Leu 


Leu 


^ y ^ 


Val 


Pro 


Ser 


Pro 


Ala 


Ser 


Gin 










1055 








1060 










1065 


o ly 


Asp 


Leu 


His 


Thr 


Lys 


Pro 


Leu 


Glv 

xy 


Thr 


Asp 


Asp 


Asp 


Phe 


Trn 










107 0 










1075 










1080 


xy 


Pro 


Thr 


Gly 


Pro 


Val 


Ala 


Thr 


Glu 


Val 


Val 


Asp 


Lys 


Glu 


Lys 








108 5 










109 0 










1095 


AS n 


Leu 


X y X 


x^i. y 


Va 1 


His 


Phe 


Pro 


Va 1 


Ala 


Glv 

»^ X y 


Ser 


Tvr 

J. y X 


Arg 


TrD 










110 0 










1105 










1110 


Pro 


As n 


Thr 


Gly 


Leu 




Phe 


Val 


Met 


Arg' 


Glu 


Ala 


Val 


Thr 


Val 










1115 










1120 










1125 


G 1 u 


lie 


Glu 


Phe 


Cvs 

v_ y ^ 


Val 


TrD 
i. 1. ^ 


As p 


Gin 


Phe 


Leu 


Glv 


Glu 


lie 


Asn 










113 0 










1135 










1140 


Pro 


Gin 


His 


Ser 


T^D 


Met 


Val 


Ala 


Glv 


Pro 


Leu 


Leu 


Asp 


lie 


Lys 










114 5 










1150 










1155 


Ala 


Glu 


Pro 


Gly 


Ala 


Val 


Glu 


Ala 


Val 


His 


Leu 


Pro 


His 


Phe 


Val 








1160 










1165 










1170 


Ala 


Leu 


Gin 


Gly 


Gly 


His 


Val 


Asp 


Thr 


Ser 


Leu 


Phe 


Gin 


Met 


Ala 










1175 










1180 










1185 


His 


Phe 


Lys 


Glu 


Glu 


Gly 


Met 


Leu 


Leu 


Glu 


Lys 


Pro 


Ala 


Arg 


Val 










1190 










1195 










1200 


Glu 


Leu 


His 


His 


lie 


Val 


Leu 


Glu 


Asn 


Pro 


Ser 


Phe 


Ser 


Pro 


Leu 










1205 










1210 










1215 


Gly 


Val 


Leu 


Leu 


Lys 


Met 


lie 


His 


Asn 


Ala 


Leu 


Arg 


Phe 


He 


Pro 
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1 7 9 S ^^^^ 
val Thr Ser Val V^l Leu Leu T^yr H.s^Arg Val His Pro Glu_^G.u 

Val Thr Phe H.s^Leu Tyr L.u He Preiser Asp Cys Ser Ile^Arg 

.ys Ala lie Asp'isp Leu Clu Met Lys ^Phe Gin Phe Val Argjle 

H.s Lys pro Pro'pro Leu Thr Pro Leu^Tyr Met Oly Cys Arg^Tyr 

Thr val ser Gly'seJ Gly Ser Gly Met^Leu Glu He Leu Pro^Lys 

Olu Leu Glu Leu'cjs Arg Ser Pro^Gly Glu Asp Gin Leu^Phe 

ser Glu ser Tyr'vi? Gly His Leu Gly Ser Gly He Arg Leu^Gln 

val Lys ASP Lys\ys Asp Glu Thr Leu Jal Trp Glu Ala Leu^Val 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> incyte ID No: 6726656CD1 

MeJ^Lys Leu Trp Met Glu Ser His Leu lie Val Pro Glu Thr Arg 
pro ser Pro Arg Met Met Ser Asn Gin Thr Leu Val Thr Glu Phe 
He Leu Gin Gly pSe Ser Glu His Pro Glu Tyr Arg Val Phe Leu 
P.e ser cys Phe Leu Phe Leu Tyr Ser Gly Ala Leu Thr Gly Asn 
val Leu He Thr Leu Ala He Thr Phe Asn Pro Gly Leu His Ala 
pro Met Tyr Phe Phe Leu Leu Asn Leu Ala Thr Met Asp He H^e 
^.,3 ser ser l!e Met Pro Lys Ala Leu Ala Ser Leu Val Ser 

nu Glu ser ser l!e Ser Tyr Gly Gly Cys Met Ala Gin Leu ^yr 



Phe Leu Thr Trp Ala Ala Ser Ser Glu Leu Leu Leu Leu Thr Val 
Met Ala Tyr Asp Arl Tyr Ala Ala He Cys His Pro Leu Hxs Tyr 
ser ser Met Met Ser Lys Val Phe Cys Ser Gly Leu Ala Thr Ala 

10/31 



BNSDOClD <WO =1570e5A2 I > 



wo 01/5^085 



PCT/tS01/(^3455 











1 c; c 
1 J 










1 O ^' 










16 5 


V a. 1 


Trp 




Lj fcr L*. 


\^ 3 


.--i. 1 a 




:r3 » 1 


i n r 




T 


His 




^^ly 






























1 O J 




T 


A r 5 


Leu 






v^ys 


r 1 V 


Pro 




Va 1 


lie 


lie 


His 


Phe 










1 o D 










± y u 










19 5 






G 1 u 


Va 1 


■p y- /-N 


D r- ,— 

ir r O' 


Lj t= ^ 


r 


Leu 


Leu 

0 


Ser 


Cvs 

u y i. 


Ser 


Ser 


Thr 
^ 1 


^yr 


»' ci ± 


A s n 


i. y 


2 G 0 

V a. 1 






v/^ 1 
V d i 


Leu 


i- J D 


Asp 


Ala 


Phe 


T\/T 

1 y L 


J. y 


















"* 7 n 

A, Z U 










2 2 5 




Ve- 1 


A s n 




Leu 




Thr 


i X 


t\ Lex 




X y L 


Gly 


Phe 


He 


Va 1 










7 n 

A- J U 










^ 

^ w ^ 










2 4 0 




s e r 


1 ± e 


Leu 


Liys 


va 1 


Lys 


* nr 




Trp 


^ X y 


Arg 


Gin 


ijy ^ 


Ala 










/ ^ 








^ n 

i. -I u 










2 55 






Tnr 


cys 


b er 


C z:^ >- 

o eiT 


n 1 S 


Leu 


i 1 1 1. 


V d X 


V d X 




Me t 


i y L 


i y L 








A Pi 
^ D U 










2 6 5 










27 0 


^ n r 


1 a 


V d, 1 


"DV-1 ^ 

-rne 


Tyr 


Zl 1 54 
/\ 1 a 




^ X tr 


o *r i 




Val 


Ser 


n 1 w 

J. y 


y X 


Ser 










275 








28C 










285 


Ala 


Gly 


Lys 


Ser 


Lys 


Leu 


Ala 


Gly 


Leu 


Leu 


Tyr 


Thr 


Val 


Leu 


Ser 






290 










295 










300 


Pro 


Thr 


Leu 


Asn 


Pro 


Leu 


He 


Tyr 


Thr 


Leu 


Arg 


Asn 


Lys 


Glu 


Val 










305 










310 










315 


Lys 


Ala 


Ala 


Leu 


Arg 


Lys 


Leu 


Phe 


Pro 


Phe 


Phe 


Arg 


Asn 












320 










325 













<210> 10 

<211> 384 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472062CD1 

< 4 0 0 > 10 



Met 


Asn 


Val 


Leu 


Leu 


Ala 


Asp 


Ser 


Asn 


Ser 


Asn 


Lys 


Lys 


He 


Val 


1 








5 










10 










15 


His 


Lys 


His 


He 


Cys 


Ser 


Leu 


Gin 


Ser 


Ala 


Pro 


Lys 


Thr 


Thr 


Asn 








20 










25 










30 


Leu 


Gin 


Pro 


Ser 


He 

35 


Ser 


Asp 


He 


Leu 


Leu 

40 


Ser 


Val 


Glu 


Ser 


Asn 
45 


Asp 


Arg 


Lys 


Asn 


Val 


Ser 


Lys 


He 


Lys 


Gly 


Asp 


Cys 


Phe 


Asn 


Thr 








50 










55 










60 


Arg 


Val 


Ser 


Cys 


Asp 
65 


Ser 


Lys 


He 


Thr 


Ser 

70 


Met 


Glu 


Asn 


Asn 


Thr 

75 


Glu 


Val 


Ser 


Glu 


Phe 
80 


He 


Leu 


Leu 


Gly 


Leu 
85 


Thr 


Asn 


Ala 


Pro 


Glu 
90 


Leu 


Gin 


Val 


Pro 


Leu 
95 


Phe 


He 


Met 


Phe 


Thr 
100 


Leu 


He 


Tyr 


Leu 


He 
105 


Thr 


Leu 


Thr 


Gly 


Asn 


Leu 


Gly 


Met 


He 


He 


Leu 


He 


Leu 


Leu 


Asp 








110 








115 










120 


Ser 


His 


Leu 


His 


Thr 


Pro 


Met 


Tyr 


Phe 


Phe 


Leu 


Ser 


Asn 


Leu 


Ser 










125 








130 










135 


Leu 


Ala 


Gly 


He 


Gly 


Tyr 


Ser 


ser 


Ala 


Val 


Thr 


Pro 


Lys 


Val 


Leu 








140 








145 










ISO 


Thr 


Gly 


Leu 


Leu 


He 


Glu 


Asp 


Lys 


Ala 


He 


Ser 


Tyr 


Ser 


Ala 


Cys 








155 








160 










165 


Ala 


Ala 


Gin 


Met 


Phe 


Phe 


Cys 


Ala 


Val 


Phe 


Ala 


Thr 


Val 


Glu 


Asn 










170 








175 










180 


T-/r 


Leu 


Leu 


Ser 


Ser 


Met 


Ala 


Tyr 


Asp 


Arg 


Tyr 


Ala 


Ala 


Val 


Cys 








185 










190 










195 


Asn 


Pro 


Leu 


His 


Tyr 

200 


Thr 


Thr 


Thr 


Met 


Thr 

205 


Thr 


Arg 


Val 


cys 


Ala 
210 


cys 


Leu 


Ala 


He 


Gly 


Cys 


Tyr 


Val 


He 


Gly 


Phe 


Leu 


Asn 


Ala 


Ser 








215 








220 










225 


: le 


Gin 


He 


Gly 


Asp 


Thr 


Phe 


Arg 


Leu 


Ser 


Phe 


cys 


Met 


Ser 


Asn 








230 










235 










240 


Val 


He 


His 


His 


Phe 


Phe 


Cys 


Asp 


Lys 


Pro 


Ala 


Val 


He 


Thr 


Leu 



1 1/31 



BNSDOCID <WC :i57'::>e5A2 1 > 



PCT/LiSOl/03455 

W O OW5'^(»85 

Thr cys ser Glu Ly^ H.s He Ser Glu Leu lie Leu Val Leu lie 
ser ser Phe Asn VaJ Phe Phe Ala Leu Leu Val Thr Leu He Ser 
Leu Phe lie III He Thr He Leu Lys Arg Hxs Thr Gly Lys 
^.ly Tyr Gin Lys Pro Leu Ser Thr Cys Gly Ser H.s Leu He .la 
He Phe Leu Phe ^r He Thr Val He He Met Tyr He Arg Pro 
ser ser Ser His Ser Mec Asp Thr Asp Lys He Ala Ser Val Phe 
Tyr Thr Met He "le Pro Met Leu Ser Pro He Val Tyr Thr Leu 
Arg Asn Lys Asp Jal Lys Asn Ala Phe Met Lys Val Val Glu Ly. 

Ala Lys Tyr Ser Leu Asp Ser Val Phe 

380 

<:::io> 11- 

1 1 1 > 419 
-'. 1 2 > PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7472067CD1 

;rt°Lei\la Ala Ala Phe Ala Asp Ser Asn Ser Ser Ser Met Asn 
Va! ser Phe Ala Hxs Leu His Phe Ala Gly Gly Tyr Leu Pro Ser 
ASP ser Gin Asp Trp Arg Thr He He Pro Ala Leu Leu Val Ala 
val cys Leu Val G^ly Phe Val Gly Asn Leu Cys Val He Gly H^e 
Leu Leu H.s Asn A^la Trp Lys Gly Lys Pro Ser Met He H.s Ser 
Leu He Leu Asn L^u Ser Leu Ala Asp Leu Ser Leu Leu Leu Phe 
ser Ala Pro He Arg Ala Thr Ala Tyr Ser Lys Ser Val Trp Asp 
Leu Gly Trp Phe Val Cys Lys Ser Ser Asp Trp Phe He His Thr 
cys Met Ala Ala '.ys Ser Leu Thr He Val Val Val Ala Lys Val 
cys Phe Met Tyr III Ser Asp Pro Ala Lys Gin Val Ser He Hxs 
.sn TVr Thr He ser Val Leu Val Ala He Trp Thr Val Ala 

se^ Leu Leu Pro Leu Pro Glu Trp Phe Phe Ser Thr He Arg Hxs 
H.s Glu Gly val ^ Met Cys Leu Val Asp Val Pro Ala Val Ala 
aiu Glu Phe Met Ser Met Phe Gly Lys Leu Tyr Pro Leu Leu Ala 
Phe Gly Leu Pro Leu Phe Phe Ala Ser Phe T-yr Phe Trp Arg Ala 
-yr ASP Gin Cys Lys Lys Arg Gly Thr Lys Thr Gin Asn Leu Arg 
Asn Cln He Arg "ser Lys Oln Val Thr Val Met Leu Leu Ser He 
Ala He He Ser fla Leu Leu Trp Leu Pro Glu Trp Val Ala Trp 
Leu Trp val Trp h'is Leu Lys Ala Ala Oly Pro Ala Pro Pro Gin 

12/31 



BNSDOClD ^WC . 0157OB5A2 



wo (M 5^0H5 



PCT/lSi>l/(>3455 











275 










2 8 0 










235 




Phe 


I le 


Ala 


Leu 


Ser 


Gin 


Va 1 


Leu 


Met 


Phe 


Ser 


He 


Ser 


Ser 










290 










2 9 5 










3 00 


Ala 


As n 


Pro 


Leu 


He 


Phe 


Leu 


Val 


Met 


Ser 


Glu 


Glu 


Phe 


Arg 


Glu 










3 0 5 










3 10 










315 


- - , . 
^ J, 


Leu 


L vs 


Gly 


Val 


TrD 


Lys 


Trp 


Me:: 


He 


Thr 


Lys 


Lys 


Pro 


Pro 










320 










325 










330 


Tri r 


Val 


Ser 


Glu 


Ser 


Gin 


Glu 


Thr 


Pre 


Ala 


Gly 


Asn 


Ser 


Glu 


Gly 










3 35 










340 










345 


Lgu 


Pro 


Asp 


Lvs 


Val 


Pro 


Ser 


Pre 


Glu 


Ser 


Pro 


Ala 


Ser 


He 


Pro 








350 










355 










360 


Glu 


Lys 


Glu 


Lys 


Pro 


Ser 


Ser 


Pro 


Ser 


Ser 


Gly 


Lys 


Gly 


Lys 


Thr 










3 65 










370 










375 


Glu 


Lys 


Ala 


Glu 


He 


Pro 


He 


Leu 


Pro 


Asp 


Val 


Glu 


Gin 


Phe 


Trp 








380 










385 










390 


His 


Glu 


Arg 


Asp 


Thr 


Val 


Pro 


Ser 


Val 


Gin 


Asp 


Asn 


Asp 


Pro 


He 










395 










400 










405 


Pro 


Trp 


Glu 


His 


Glu 


Asp 


Gin 


Glu 


Thr 


Gly 


Glu 


G ly 


Val 


Lys 












4 10 










415 












<2 10> 12 


























<211> 314 


























<2 12> PRT 


























<2 13 > Homo sapiens 






















<220> 




























<221> misc_f eature 






















<223> Incyte ID 


IMO . 


7472072CD1 
















<400> 12 


























Met 


Gly 


Asp 


Val 


AS n 


Gin 


Ser 


Val 


Ala 


Ser 


Asp 


Phe 


He 


Leu 


Val 


1 






5 










10 










15 


Gly 


Leu 


Phe 


Ser 


His 


Ser 


Gly 


Ser 


Arg 


Gin 


Leu 


Leu 


Phe 


Ser 


Leu 








20 










25 










3 0 


Val 


Ala 


Val 


Met 


Phe 


Val 


He 


Gly 


Leu 


Leu 


Glv 


Asn 


Thr 


Val 


Leu 










35 










40 










45 


Leu 


Phe 


Leu 


He 


Arg 


Val 


Asp 


Ser 


Arg 


Leu 


His 


Thr 


Pro 


Met 


Tvr 










50 










55 










60 


Phe 


Leu 


Leu 


Ser 


Gin 


Leu 


Ser 


Leu 


Phe 


Asp 


He 


Glv 


Cvs 


Pro 


Met 










65 










70 










75 


Val 


Thr 


He 


Pro 


Lys 


Met 


Ala 


Ser 


Asp 


Phe 


Leu 


Arg 


Glv 


Glu 


Gly 










80 










85 










90 


Ala 


Thr 


Ser 


Tyr 


Gly 


Gly 


Gly 


Ala 


Ala 


Gin 


He 


Phe 


Phe 


Leu 


Thr 










95 










100 










105 


Leu 


Met 


Gly 


Val 


Ala 


Glu 


Gly 


Val 


Leu 


Leu 


Val 


Leu 


Met 


Ser 


Tyr 








110 










115 










120 


Asp 


Arg 


Tyr 


Val 


Ala 


Val 


Cys 


Gin 


Pro 


Leu 


Gin 


Tyr 


Pro 


Val 


Leu 










125 










130 










135 


Met 


Arg 


Arg 


Gin 


Val 


Cys 


Leu 


Leu 


Met 


Met 


Glv 


Ser 


Ser 


Trp 


Val 










140 










145 










150 


Val 


Gly 


Val 


Leu 


Asn 


Ala 


Ser 


He 


Gin 


Thr 


Ser 


He 


Thr 


Leu 


His 








155 










160 










165 


Phe 


Pro 


Tyr 


Cys 


Ala 


Ser 


Arg 


He 


Val 


Asp 


His 


Phe 


Phe 


Cys 


Glu 










170 










175 










180 


Val 


Pro 


Ala 


Leu 


Leu 


Lys 


Leu 


Ser 


Cvs 


Ala 


Asp 


Thr 


Cys 


Ala 


Tyr 










185 










190 










195 


Glu 


Met 


Ala 


Leu 


Ser 


Thr 


Ser 


Gly 


Val 


Leu 


He 


Leu 


Met 


Leu 


Pro 










200 










205 










210 


Leu 


Ser 


Leu 


He 


Ala 


Thr 


Ser 


Tyr 


Gly 


His 


Val 


Leu 


Gin 


Ala 


Val 










215 










220 










2 2 5 


Leu 


Ser 


Met 


Arg 


Ser 


Glu 


Glu 


Ala 


Arg 


His 


Lys 


Ala 


Val 


Thr 


Thr 










230 










235 










240 


Cys 


Ser 


Ser 


His 


He 


Thr 


Val 


Val 


Gly 


Leu 


Phe 


Tyr 


Gly 


Ala 


Ala 








245 










250 










255 


Val 


Phe 


Met 


Tyr 


Met 


Val 


Pro 


Cys 


Ala 


Tyr 


His 


Ser 


Pro 


Gin 


Gin 










260 










265 










270 


Asp 


Asn 


Val 


Val 


Ser 


Leu 


Phe 


Tyr 


Ser 


Leu 


val 


Thr 


Pro 


Thr 


Leu 



13/31 



BNSDOC'D <WO 



015708 6A2 



PCT/l S(M/OJ455 

WO 01/57085 



280 285 



P.O .eu XXe V/r Ser Leu Ar. Asn Pro CIu Va. Trp Met Ma 
.eu Va. .ys Val ^.^^ Ser ^r. Ma O.y .eu .r. C.n Me. C.s 



305 

<:;10> 13 
<211> 254 
<212> PRT 

<113> Homo sapiens 
<220> 

'->21> misc_featurG 



rrc"l"se. Ar, Tyr Val .1. val =ly «et He L.u s,. =ln ... 
O.y Val P.e S„ V.l ..u ..u „U Ty. 

P.e xy. cy, T =xy cy. .-u s„ T.. «P - 

.y= H.s llJ v.. M. P.. L» ^f, 

,1, pro 4? M« M. ^ 

... Cly .ys P.. ^1 - 

.y V. s.. 3. r.. cy, ... V. P., ^ 

9 5 ^ rp^r^ Ala Lvs Lgu Lys 

v.l II. val s.r s.r Ar, Ws s.t «| Trp Al. y 

.y. P,o ij^^ V. =.y v.. ... ..u cy. 

Xle V.1 cy. M.t i~ V.1 ..n H. H. -| "» ^' 155 

,ly .ys «P ^? T« .=n XI. -J ..n Olu »P ..u -V 

J cy. s.. oiy i!y c=.y - ;i| fa? 

17 0 ^ r o^7c= Ten Glv Leu Met 

T^n q^r Phe Pro ASP val Leu Cys Leu c.xy 
Ala Met Leu Leu Ser pne t^J- ±^ i^d 

..p se. S s„ v.. cy. xje ..u H.. 

... =i„ ri ?K "° 7: i 

P„ =1„ «„ i^l X.. Cln S.. n| V- 

. c-^^ Px/c; Leu Phe Gin Val 
Thr Phe val Ser Ser Tyr Thr Leu Ser cys 



245 

<210> 14 
<211> 362 
<212> PRT 

<213> Homo sapiens 
<220> 



TS^'STo: 747Z0,7CD1 

S^e?°;.r\y. A=P cy. XL oxu ser T., .xy »P Tyr P« L.u .y. 

Cyi .sp M. ox. Cly pro Trp OXy XXe XI. LeU =XU S.r L.u 

... oxy xxe vfx V.X XX. ..u ... ... - - ^ 



35 

14/31 



BNSOOCID <W0 0157065A2 



wo (M 5^085 



PCT/LS(» 1/03455 





Leu 


Met 






lie 


Gin 


Asp 


Cvs 


Ser 


Gin 


Trp 


Asn 


Val 


Leu 










5 0 










5 5 










6 0 


?r c 


Thr 


Gin 


Leu 


Leu 


Phe 


Leu 


Leu 


Ser 


Val 


Leu 


Gly 


Leu 


Phe 


Gly 










65 










70 










75 


Li eu. 


Ala 


Phe 


Ala 


Phe 


lie 


He 


Glu 


Leu 


Asn 


Gin 


Gin 


Thr 


Ala 


Pro 










30 










S5 










90 


Va 1 




y 


Phe 


Leu 


Phe 


Gly 


Va 1 


Leu 


Phe 


Aia 


Leu 


Cys 


Phe 


Ser 








95 










IGC 










105 


^ y ^ 


Leu. 


Leu 


Ala 


His 


Ala 


Ser 


As n 


Leu 


Val 


Lys 


Leu 


Val 


Arg 


Gly 








110 










115 










120 


^- y ^ 


Vai 


Ser 


Phe 


Ser 




Thr 


Thr 


I le 


Leu 


Cvs 


He 


Ala 


He 


Gly 








125 








13 0 










135 


y ^ 


Ser 


Leu 


Leu 


Gin 


He 


I le 


He 


Ala 


Thr 


Glu 


Tvr 


Val 


Thr 


Leu 








14 0 










145 










150 


lie 


Me t 


Thr 


Arg 


Gly 


Met 


Met 


Phe 


Val 


Asn 


Met 


Thr 


Pro 


Cys 


Gin 








155 










160 










165 


Lj e'>^ 


As n 


Va 1 


Asp 


Phe 


Val 


Val 


Leu 


Leu 


Val 


Tyr 


Val 


Leu 


Phe 


Leu 








17 0 










175 










180 




A 1 a 


Leu 


Thr 


Phe 


Phe 


Va 1 


Ser 


Lys 


Ala 


Thr 


Phe 


Cys 


Gly 


Pro 










185 










190 










195 




Glu 


As n 




Lys 


Gin 


His 


Glv 


Arg 


Leu 


He 


Phe 


He 


Thr 


Val 








2 0 0 










205 










210 




Phe 


Ser 


lie 


He 


He 


Trn 
i J- ^ 


Va 1 


Val 


TrD 


He 


Ser 


Met 


Leu 


Leu 










215 










220 










225 




G 1 y 


As n 


Pr o 


Gin 


Phe 


Gin 


Arg 


Gin 


Pro 


Gin 


Trp 


Asp 


Asp 


Pro 








2 3 0 










235 










240 


V d i. 


V d J. 


(wys 


lie 


Ala 


Leu 


Va 1 


Thr 


As n 


Ala 


TrD 


Val 


Phe 


Leu 


Leu 








2 4 5 










250 










255 


T 1 


T"/r 




Val 


Pro 


Glu 


Leu 


V, y 


He 


Leu 


Tvr 


Arg 


Ser 


Cvs 


Arg 








2 60 










265 










270 


i r 1 


Vjj i u 


Cys 


X v_; 




Gin 


Gly 


As n 


Ala 


Cvs 


Pro 


Val 


Thr 


Ala 


Tvr 








27 5 










280 










285 


J. 1 1 




o cii. 




Gin 


Val 


Glu 


As n 


Gin 


Glu 


Leu 


Ser 


Arg 


Asp 


Lvs 










2 9 0 










29 5 










3 00 


i rp 


Lys 


Val 


Leu 


Leu 


As n 


Ser 


Asp 


Phe 


Leu 


Ser 


His 


Ser 


Gly 


Ala 






305 










310 










315 


Ala 


Arg 


Asp 


Ser 


Asp 


Gly 


Ala 


Glu 


Glu 


Asp 


Val 


Ala 


Leu 


Thr 


Ser 






320 










325 










330 


Tyr 


-ly 


Thr 


Pro 


He 


Gin 


Pro 


Gin 


Thr 


Val 


Asp 


Pro 


Thr 


Gin 


Glu 






335 










340 










345 


Cys 


Phe 


He 


Pro 


Gin 


Ala 


Lys 


Leu 


ser 


Pro 


Gin 


Gin 


Asp 


Ala 


Gly 








350 










355 










360 



Gly Val 

<210> 15 

<211> 370 

< 2 1 2 > PRT 

< 2 1 3 > Homo sap i ens 

<220> 

<22 1> misc_f eature 

<223> Incyte ID No: 7472082CD1 

<400> 15 



Met 
1 


Cys 


Lys 


Cys 


Phe 
5 


Arg 


Ser 


Gly 


Asn 


Ser 

10 


Thr 


Pro 


Val 


Leu 


cys 
15 


His 


Arg 


Asn 


Ser 


Glu 
20 


Ala 


Trp 


Gin 


Pro 


Arg 

25 


Lys 


Ala 


Pro 


Arg 


Thr 

30 


Gin 


Gin 


Thr 


Asp 


Met 

35 


Gly 


Tyr 


Thr 


Asn 


Leu 
40 


Asn 


Ser 


Lys 


Lys 


Glu 
45 


Cys 


Met 


Tyr 


He 


Lys 

50 


Glu 


Asn 


Phe 


Lys 


Lys 
55 


Thr 


Val 


Asp 


Lys 


He 

60 


Val 


Asp 


Pro 


Gly 


Asn 
65 


His 


Ser 


Ser 


Val 


Thr 

70 


Glu 


Ser 


He 


Leu 


Ala 

75 


Gly 


Leu 


Ser 


Glu 


Gin 
80 


Pro 


Glu 


Leu 


Gin 


Leu 
85 


Arg 


Leu 


Phe 


Leu 


Leu 
90 
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-'-^ Val Gly Asn Leu Gly Met 
100 

. s -hr Pro Met T:,'r 

12 0 



Phe 


Lsu 




I ie 


Cys Val ^ 
95 


lie 


1 nr 


L eu 


lie 


Gly Leu 
110 


Ty r 




Leu 


Ser 


Ser Leu 
125 


Va. 1 


T 1 P> 


Thr 


Pro 


Lys Met 
140 


lie 


i J- *= 


Ser 


Tyr 


pro Glu 
155 


lie 


ir lit: 


Ala 


He 


Ala Glu 
170 


Asp 


Cys 


Tyr 


Val 


Ala He 
185 


Met 


Ser 


Tyr 


His 


His cys 

200 


Leu 


Gly 


I le 


Leu 


Gly ser 
215 



115 

Asp Phe cys His Ser .ii^^ 

130 

Phe Ala Thr Glu Lys Asn 
145 

Gin Leu Tyr Leu Phe ser 

160 

Leu Ala Ala Met Ala T/r 

175 

^ T ^-^ Tyr Asn Val He 

195 



190 

Thr Val Gly Val lyr xx^ 

205 

Thr ser Phe Met Leu Arg 

220 

Asn His Tyr Phe Cys Asp 
235 



^ c^r- q^r Thr Tyr He Asn 
oeu Phe Pro Leu Leu Gly Leu Ser Cys Ser Ser 

11'. Va^ -u ser Ala Phe Asn He Leu Met Pro 
Glu Leu Leu Val Le^ va. ---^ ./J 
260 ^. ov,^ Tie Ala Ser He 

.Xa Leu Thr He Leu Ala Ser Tyr He Phe He 

275 ^ r,,- Ala Phe Ser Thr 

I^eu Arg He H.s Ser Thr Glu Gly Arg Ser Lys 

290 ^ ^. r>y.r^ Phe Gly ser Ala 

cys ser Ser His He Leu Ala Val Ala Val Phe 

305 . cer Ser Met Asp Gin 

Ala Phe Met Tyr Leu Gin Pro Ser Ser Val Ser 

3 20 ^ rx^h^ T^r He Val Pro Met Leu 

Arg Lys Val Ser Ser Val Phe Tyr Thr Thr 

T^^^ T^7c; Asn Val Lys Leu Ala 
Asn pro Leu He Tyr Ser Leu Arg Asn Lys Asp V y 3^^ 

val Lys Lys He Leu His Gin Thr Ala Cys 

3 6 5 



<210> 16 
<211> 319 
<212> PRT 

<213> Homo sapiens 
<220> 



<221> Tnisc_feature ^^^^.^^^.p. 
<223> incyte ID No: 7472H8;.D1 

«v .=u 

val T.. ... ... =ln Oly Phe Ty. «« A=P -f^ 

2° , i.u Leu He Tyr Al» TKr 

Gin Glu Leu Leu PM Gly AU He Leu Leu Y 

Va. val Cly Oly Met Met .la Leu He Ph. Thr ..p Ser 

Leu Gin Ser P^ro Me. Tyr Phe Phe Leu .su val Leu ser Ph^ 

65 , ry^j. pro Lys Leu Leu Val 

Leu ASP He Cys Tyr Ser Ser Val Val Thr 

Phe Leu Val sl Asp Lys Ser He Ser Phe Glu Oly Cys Val 
val Gin Leu Ala Phe Val Val H.s Val Thr Ala Glu Ser P e 

... Leu Ala Ser 1.11 Ala Tyr Asp Arg Phe Leu Ala He Cys Oln 
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Pro 


Leu 


H i s 


Tyr 


Gly 


Ser 


He 


Met 




Arg 


Gly 


Thr 


cys 


Leu 


Gin 








140 










145 










150 


Lgu 


Val 


Ala 


Val 


Ser 


Tyr 


Ala 


Phe 


Gly 


Gly 


Ala 


Asn 


Ser 


Ala 


He 










15 5 










160 










165 


3 1 n 


Thr 


Gly 


Asn 


V'a 1 

17C 


Phe 


Ala 


Leu 


Pro 


Phe 
17 5 


Cys 


Gly 


Pro 


Asn 


Gin 

ISO 


Leu 


Thr 


His 


Tyr 


Tyr 


Cys 


Asp 


lie 


Pro 


Pro 


Leu 


Leu 


His 


Leu 


Ala 










185 










190 










19 5 


Cys 


Ala 


Asn 


Thr 


Ala 


Thr 


Ala 


Arg 


Val 


Val 


Leu 


Tyr 


Val 


Phe 


Ser 








2 00 










205 










210 


Ala 


Leu 


Val 


Thr 


Leu 


Leu 


Pro 


Ala 


Ala 


Val 


He 


Leu 


Thr 


Ser 


Tyr 










215 










220 










225 


Cys 


Leu 


Val 


Leu 


Val 


Ala 


He 


Gly 


Arg 


Met 


Arg 


Ser 


Val 


Ala 


Gly 








230 










235 










240 


Arg 


Glu 


Lys 


Asp 


Leu 


Ser 


Thr 


cys 


Ala 


Ser 


His 


Phe 


Leu 


Ala 


He 




245 










250 










255 


Ala 


He 


Phe 


Tyr 


Gly 


Thr 


Val 


Val 


Phe 


Thr 


Tyr 


Val 


Gin 


Pro 


His 








260 










2 65 










27 0 


G 1 y 


Ser 


Thr 


Asn 


Asn 


Thr 


Asn 


Gly 


Gin 


Val 


Val 


Ser 


Val 


Phe 


T"/r 










275 










280 










285 


Thr 


He 


He 


He 


Pro 


Met 


Leu 


Asn 


Pro 


Phe 


He 


Tyr 


Ser 


Leu 


Arg 










290 










295 










300 


Asn 


Lys 


Glu 


Val 


Lys 


Gly 


Ala 


Leu 


Gin 


Arg 


Lys 


Leu 


Gin 


Val 


Asn 








305 










310 










315 


He 


Phe 


Pro 


Gly 

























<210> 17 

<211> 312 

<212> PRT 

<213> Homo sapiens 



<220> 

<22 1> misc_f eature 

<223> Incyte ID No: 7472134CD1 

<400> 17 



Met 


Asp 


Thr 


Gly 


Asn 


Trp 


Ser 


Gin 


Val 


Ala 


Glu 


Phe 


He 


He 


Leu 


1 




5 










10 










15 


Gly 


Phe 


Pro 


His 


Leu 


Gin 


Gly 


Val 


Gin 


He 


Tyr 


Leu 


Phe 


Leu 


Leu 








20 








25 










30 


Leu 


Leu 


Leu 


He 


Tyr 
35 


Leu 


Met 


Thr 


Val 


Leu 
40 


Gly 


Asn 


Leu 


Leu 


He 
45 


Phe 


Leu 


Val 


Val 


Cys 
50 


Leu 


Asp 


Ser 


Arg 


Leu 
55 


His 


Thr 


Pro 


Met 


Tyr 

60 


His 


Phe 


Val 


Ser 


He 
65 


Leu 


Ser 


Phe 


Ser 


Glu 

70 


Leu 


Gly 


Tyr 


Thr 


Ala 

75 


Ala 


Thr 


He 


Pro 


Lys 
80 


Met 


Leu 


Ala 


Asn 


Leu 

85 


Leu 


Ser 


Glu 


Lys 


Lys 
90 


Thr 


He 


Ser 


Phe 


Ser 
95 


Gly 


Cys 


Leu 


Leu 


Gin 

100 


He 


Tyr 


Phe 


Phe 


His 
105 


Ser 


Leu 


Gly 


Ala 


Thr 


Glu 


Cys 


Tyr 


Leu 


Leu 


Thr 


Ala 


Met 


Ala 


Tyr 








110 








115 










120 


Asp 


Arg 


Tyr 


Leu 


Ala 

125 


He 


Cys 


Arg 


Pro 


Leu 

130 


His 


Tyr 


Pro 


Thr 


Leu 

135 


Met 


Thr 


Pro 


Thr 


Leu 
140 


Cys 


Ala 


Glu 


He 


Ala 

145 


He 


Gly 


Cys 


Trp 


Leu 
150 


Gly 


Gly 


Leu 


Ala 


Gly 


Pro 


Val 


Val 


Glu 


He 


Ser 


Leu 


He 


Ser 


Arg 






155 










160 










165 


Leu 


Pro 


Phe 


Cys 


Gly 


Pro 


Asn 


Arg 


He 


Gin 


His 


Val 


Phe 


Cys 


Asp 








170 










175 










ISO 


Phe 


Pro 


Pro 


Val 


Leu 
185 


Ser 


Leu 


Ala 


Cys 


Thr 

190 


Asp 


Thr 


Ser 


He 


Asn 
195 


Val 


Leu 


Val 


Asp 


Phe 


Val 


I le 


Asn 


Ser 


Cys 


Lys 


He 


Leu 


Ala 


Thr 








200 










205 










210 


Phe 


Leu 


Leu 


He 


Leu 


Cys 


Ser 


Tyr 


Val 


Gin 


He 


He 


Cys 


Thr 


Val 










215 








220 










2 2 5 



17/31 



3NSDOC10 <WC :*570e5A2 i > 



PCT/lS*n/034 

w o 01/57085 

^-'^ "° ^ 

C,. S., ... ill ™» va. U, X.. a.. 

. „ TV. Ill n„ S„ ..u «P 

260 ^ f,^. . mv.^ pr-o ?he Leu 

T^/r Ser Va^ Leu *nr ri. 
ASP Gin Ala Leu A.a Val Vai i/r 285 

27t) ^ ^ p-,^ lie Lys Glu Ala 

Asn pro Phe He Tyr Ser Leu Arg Asn Lys Glu 

Val Ar. Arc Cln L^u Lys Arg IX. Leu Ma 



<210> 18 

<211> 321 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 7472136CD1 



;;i?°Li'..n T« ... A» S., Se. O.V XH. V.. ... Ma 

s.r =ly =« -1 Hi. T« Tvr .,u v.. ..u 
20 " „i„ Met ala Gly Asn 

ser S.r L.U M. Het Ph. Tbr Cy. L.u Cy, ^ 

„e. v.. n= T^rl -V P.. «| To 

P« cy. ..e .y. 

^ 6S ^ _ pin Thr Gin Pro Leu 

^ q^r Thr Leu Ser Leu Giu inr 

Leu Phe ser Met Ala Ser inr u yu 

„ai ..n ™. 4 v.. Hi. Ci. .y- - 

.y. P.. .i. xy. vai =iy ... s„ ... ... ™. .ia n. . 

L .i„ cy. s.. vai ... P.. »o - - g 

125 rr. ^r^i r\/^ Glv Leu Leu T. rp 

His Arg pro Arg His Leu Ser Ala Trp Va Cys Gly 

l-iO . T^„ Thr ser Ser Phe Cys Ser 

Thr Leu cys Leu Leu Met Asn Gly Leu Thr 
Thr Leu y 160 

Lys Phe Leu Lys Phe Asn Glu Asp Arg Cys Phe 

I'^O , w 1 '^n Thr Pro Val Met Thr 

val Gin Ala Ala Leu lie Met Gly Val ueu 

185 T j^^rr Ar-rr Se^" Ser Gin 

X.eu ser Ser Leu Thr Leu Phe Val Trp Val Arg Arg Se. 
Leu ber ^jb ^j^^ 

ai„ Trp "° ITo " = 

Se. vai ... vai n, Cy. Se, ... Pro ... ^ 

P.. vai ... T.P ... 

J P., ... f. S.. A„ ... ^« vai S.. S.. 

. eor Arg Arg Ser His 
^7^1 Tie Tvr Phe Leu Val Gly ber ^-^y ^ ^ 
Asn Pro Vai iit^ ^ ^^-^ 

p.. T... s.. ... Ciy T.. V ... 

::: =i. =i. ^.o b "° 



305 

Thr Asn Glu Met Gly Ala 
320 
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<21C> 19 
<2 11> 316 
<2 12> PRT 

<2i3> Homo sapiens 

<221> misc_f eature 

<:225 > Incyte ID No: 7472142CD1 

<400> 19 



Met 


Gin 


Gly 


Glu 


Asn 


Phe 


Thr 


He 


Trp 


Ser 


He 


Phe 


Phe 


Leu 


Glu 








5 










10 










15 


G 1 V 


PhG 


Ser 


Gin 


Tyr 


Pro 


Gly 


Leu 


Glu 


Val 


Val 


Leu 


Phe 


Val 


Phe 








20 










25 










30 


Ser 


Leu 


Val 


Met 


T"yr 


Leu 


Thr 


Thr 


Leu 


Leu 


Gly 


Asn 


Ser 


Thr 


Leu 










35 










40 










45 


lie 


Leu 


lie 


Thr 


He 


Leu 


Asp 


Ser 


Arg 


Leu 


Lys 


Thr 


Pro 


Met 


Tyr 










50 










55 










60 


Leu. 


?he 


L eu 


Glv 


Asn 


Leu 


Ser 


Phe 


Met 


Asp 


He 


Cys 


Tyr 


Thr 


Ser 








65 










70 










75 


Ala 


Ser 


Val 


Pro 


Thr 


Leu 


Leu 


Val 


Asn 


Leu 


Leu 


Ser 


Ser 


Gin 


Lys 










80 










85 










90 


Thr 


lie 


I le 


Phe 


Ser 


Glv 


Cvs 


Ala 


Val 


Gin 


Met 


Tyr 


Leu 


Ser 


Leu 










95 










100 










105 


Ala 


Met 


G ly 


Ser 


Thr 


Glu 


Cvs 


Val 


Leu 


Leu 


Ala 


Val 


Met 


Ala 


Tvr 










110 










115 










120 


A c; n 


Arg 


X y i- 


Val 


Ala 


I le 


Cys 


Asn 


Pro 


Leu 


Arg 


Tyr 


Ser 


He 


He 










125 










130 










135 


Me t 


As n 


Ar g 




Val 


Cvs 

\- y s> 


Ala 


Arg 


Met 


Ala 


Thr 


Val 


Ser 


Trp 


Val 






140 










145 










150 


Thr 


Glv 


^ y ^ 


Leu 


Thr 


Ala 


Leu 


Leu 


Glu 


Thr 


Ser 


Phe 


Ala 


Leu 


Gin 






155 










160 










165 


lie 


Pro 


Leu 


Cvs 


Glv 


Asn 


Leu 


He 


Asp 


His 


Phe 


Thr 


Cys 


Glu 


He 








170 










175 










180 


Leu 


Ala 


Val 


Leu 


Lys 


Leu 


Ala 


Cys 


Thr 


Ser 


Ser 


Leu 


Leu 


Met 


Asn 










185 










190 










195 


Thr 


He 


Met 


Leu 


Val 


Val 


Ser 


He 


Leu 


Leu 


Leu 


Pro 


He 


Pro 


Met 










200 










205 










210 


Leu 


Leu 


Val 


Cys 


He 


Ser 


Tyr 


He 


Phe 


He 


Leu 


Ser 


Thr 


He 


Leu 








215 








220 










225 


Arg 


He 


Thr 


Ser 


Ala 


Glu 


Gly 


Arg 


Asn 


Lys 


Ala 


Phe 


Ser 


Thr 


Cys 








230 










235 










240 


Giy 


Ala 


His 


Leu 


Thr 


Val 


Val 


He 


Leu 


Tyr 


Tyr 


Gly 


Ala 


Ala 


Leu 








245 










250 










255 


Ser 


Met 


Tyr 


Leu 


Lys 


Pro 


Ser 


Ser 


Ser 


Asn 


Ala 


Gin 


Lys 


He 


Asp 










260 










265 










270 


Lys 


He 


lie 


Ser 


Leu 


Leu 


Tyr 


Gly 


Val 


Leu 


Thr 


Pro 


Met 


Leu 


Asn 










275 










280 










285 


Pro 


He 


He 


Tyr 


Ser 


Leu 


Arg 


Asn 


Lys 


Glu 


Val 


Lys 


Asp 


Ala 


Met 








290 










295 










300 


Lys 


Lys 


Leu 


Leu 


Gly 


Lys 


He 


Thr 


Leu 


His 


Gin 


Thr 


His 


Glu 


His 



305 310 315 

Leu 



<210> 20 
<211> 325 
<212> PRT 

<2 13 > Homo sapiens 
< 2 2 0 > 

<2 2 1 > misc_f eature 

<223> Incyte ID No: 7472171CD1 

<4 00> 20 

Met Glu Pro Leu Asn Arg Thr Glu Val Ser Glu Phe Phe Leu Lys 
15 10 15 
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PCT/ISOU03455 

W O 01/57(^5 

c-.y P.e S.r OVy Tyr Pro Leu <=lu His L,u L,eu Pro .ju 

cy, s.r Ala T^S Leu V.l Thr L.u lJu =ly A.n TUr Ma n. 

vet Ma val Ser Leu A.p He HU L.u „.s Pro Val Tyr 

P.e Ph, L.U Oly Leu Ser T« Leu Asp He Cys Tyr T.r Pr> 

Ph. val pro L^u „.t L.u val Hr, L.u Leu S.r S.r Arg Lys 
-,r lie S.r Phe Ma Val Cys Ma He =ln «et Cys L.u S.r Leu 
ser Thr =ly s.r TOr Glu cys Leu Leu Leu M. n. Thr Ala T^r 
ASP Ar, Tyr L.u 111 11. cys om Pro L.u Ar, Tyr „is val Leu 
Met ser „rs Ar, HI Cys Val Leu Leu M.r Oly Ma Ala Trp Val 
L.U cys Leu Leu ^Js ser v.l T.r clu ... Val II. s.r H.c Ar| 
L,u pro Phe cys "ly Hrs His Val Val Ser His Ph. Thr Cys Ly. 
ne Leu Al. Val "eu Lys Leu Ala Cys Gly Asn Thr S.r Val Ser 
=lu ASP Ph. L.u "u Al. =ly S.r lie L.u L.u Leu Pro val Pro 
L.u Ala Phe ne Leu S.r TVr Leu L.u 11. Leu Ala Thr lie 

L.U Ara V.l pro ^ir Ala Ala Ar, Cys Cys Lys Al. Ph. S.r Thr 
cys L.U Al, His VI Al. V.l V.l L.U Leu Ph. Tyr Gly Thr 11. 
„e Phe Her Tyr 'dl Lys Pro Lys s.r Lys =lu Al, His II. S.r 
,,p Glu v.l Phe lh°r V.1 L.u Tvr Al. Me. V,l Thr Thr Met Leu 
„„ pro Thr II. Sr ser Leu Ar, Ash Lys Glu v.l Lys Glu Al, 
Al, Ar, Lys v.1 oly Ar, ser Ar, Al, s.r ser Glu oly Gly 

Ar, Gly S.r v.l Glh Thr Glh V.l Ser Gly 

<210> 21 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<220> 

<'^'^1> misc feature 

<223> Tncyte ID No: 7472172CD1 

<'1'^0> 21 , Gin He Thr Leu He Ala Phe 

Me- Gly Asn Trp Ser Thr Val Thr Glu lie 

.4 Ma Leu Leu Clu He Ar. He Ser Leu Phe Val Val Leu Val 
val Thr Tyr Thr L^u Thr Ala Thr Gly Asn He Thr He He Ser 

r r-ir. THr Pro Met Tyr Phe Phe 
Leu He Trp He Asp His Arg Leu Gin Thr Pro Me y 

.eu ser Asn Leu sli Phe Leu Asp He Leu Tyr Thr Thr Val He 

Thr pro Lys Leu Leu Ala Cys Leu Leu Gly Glu Glu Lys Thr He 

ser Phe Ala Gly cS Met He Gin Thr Tyr Phe Tyr Phe Phe Leu 
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PCT/LS01/*M455 



Gly 


Thr 


Val 


Glu 


Phe 


He 


Leu 


Leu 


Ala 


Val 


Met 


Ser 


Phe 


Asp 


Arg 








110 










H5 










12 0 


Tyr 


Met 


Ala 


He 


Cys 
125 


Asp 


Pro 


Leu 


His 


Tyr 

130 


Thr 


Val 


He 


Met 


Asn 

135 


Ser 


Arg 


Ala 


cys 


Leu 
140 


Leu 


Leu 


Va 1 


Leu 


Gly 
145 


Cys 


Trp 


Val 


Gly 


Ala 

150 


Phe 


Lea 


Ser 


Val 


Leu 
155 


Phe 


Pro 


Thr 


I le 


Val 
160 


Val 


Thr 


Arg 


Leu 


Pro 
165 


T^/r 


Cys 


Arg 


Lys 


Glu 
170 


He 


Asn 


His 


Phe 


Phe 

175 


Cys 


Asp 


He 


Ala 


Pro 
180 


Leu 


Leu 


Gin 


Val 


Ala 
185 


Cys 


He 


Asn 


Thr 


His 
190 


Leu 


He 


Glu 


Lys 


He 
195 


Asn 


Phe 


Leu 


Leu 


Ser 

200 


Ala 


Leu 


Val 


He 


Leu 

205 


Ser 


Ser 


Leu 


Ala 


Phe 
210 


Thr 


Thr 


Gly 


Ser 


Tyr 

215 


Val 


Tyr 


He 


He 


Ser 
220 


Thr 


He 


Leu 


Arg 


He 
225 


Pro 


Ser 


Thr 


Gin 


Gly 
230 


Arg 


Gin 


Lys 


Ala 


Phe 

235 


Ser 


Thr 


Cys 


Ala 


Ser 
240 


His 


He 


Thr 


Val 


Val 
245 


Ser 


He 


Ala 


His 


Gly 
250 


Ser 


Asn 


He 


Phe 


Val 
255 




Val 


Arg 


Pro 


Asn 
260 


Gin 


Asn 


Ser 


Ser 


Leu 
265 


Asp 


Tyr 


Asp 


Lys 


Val 
270 


Ala 


Ala 


Val 


Leu 


He 
275 


Thr 


Val 


Val 


Thr 


Pro 
280 


Leu 


Leu 


Asn 


Pro 


Phe 
285 


He 


Tyr 


Ser 


Leu 


Arg 

290 


Asn 


Glu 


Lys 


Val 


Gin 

295 


Glu 


Val 


Leu 


Arg 


Glu 
300 


Thr 


Val 


Asn 


Arg 


He 

305 


Met 


Thr 


Leu 


He 


Gin 
310 


Arg 


Lys 


Thr 







<210> 22 
<211> 1413 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7472033CB1 



<220> 

<221> unsure 

<222> 176, 383, 927 



^3> a, t. 



or other 



<400> 22 
atgaatcaga 
agcagcaaca 
acggtgcacg 
aggc cgc tgg 
tggt accaca 
accc ttggt t 
ttcatcc tea 
tcagacc tac 
tac tggccc t 
tgtattt teg 
gtgtacccca 
tgggcac tgg 
gacacgccgg 
gactggccaa 
ctgg tgccca 
cgcatcaccg 
accaac tgcc 
t tt t tcaacc 
tacgceate t 
tggc tcaacg 
aatg tggc t c 
cgeaagtatg 
agagcggtac 



cggagcccgc 
gcgtgcgc ta 
ccc tcaacac 
acccggtgc t 
tgc tcatcag 
gt tatagccc 
acc tggcca t 
t t ttatgcct 
acggctcc tg 
tgtcgacaa t 
cgcgggacag 
eac tgctgc t 
cac t cc tgca 
gtcgcaacgg 
tec tgatcgt 
tggtggc tgt 
tac tgatcag 
tgtacgcgga 
gccacatgat 
acaac ttccg 
ttaacggtca 
ggcgccgaat 
egcggcgggg 



ccagctggca 
tctggacgac 
cac tgccatc 
tatcgatagg 
catgtacggc 
gtcatccgga 
atncggccaa 
agtcaccatg 
c tccatcc tg 
atccataacg 
cc tgcagt tc 
ggcctcgccg 
gcagatcggc 
gcgct tctac 
ctcggtggca 
gcaggcncgg 
catcgccatc 
catggagcgc 
cggca tgagc 
taaagaaatt 
cacgacaggc 
tc t ccaaagg 
gaggcggtct 



gatggggagc 
cggcatccgc 
aacacc tcgg 
t tcctgagca 
gtgctaatcg 
agcccatcat 
agcaagtgtg 
ccgctgacct 
tgcaaaacga 
gccattgcc t 
gtgggcgcgg 
ctgttegtc t 
ctgcaggaca 
tactcgatct 
tae t tcggga 
aaggtggagc 
ate tt tggcg 
tcgecggtca 
tccgcc tgct 
caagaac tgc 
tgcaacgtcc 
egaactcaag 
ggcggccacc 



ate tgagtgg 
tggac tacct 
atctgaatga 
acagggcggt 
t cttcggcgc 
gcgcac tgct 
agtctcatcc 
tgatggagat 
t tgccatgc t 
t cgacagata 
tgacgatcc t 
acaaggagc t 
cgatcccgta 
tctcgc tgtg 
t c tacaacaa 
gggggcggcg 
tttcttggct 
c tcagagcat 
ccaacccgt t 
tctgccgttg 
aggcggcggc 
c tgc tggggc 
gac t tcatga 



atacgccagc 
tgacctgggc 
gactgngagc 
ggacagcccc 
cc taggcaac 
cgcaatctgg 
gagcggact t 
cc tgtccaag 
gcaggcactt 
tcaggtgatc 
ggcggggatc 
gatcaacaca 
c tgcat tgag 
cgtacaatac 
gctgaagagc 
gatgaagcgc 
gecge t tgac 
gc tagtccgc 
gc tctacggc 
c tcagacact 
gcgcaggcgt 
aggcggcgcc 
ccggcaccac 



60 

120 

180 

240 

300 

360 

420 

4S0 

540 

600 

660 

720 

780 

840 

9 00 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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W O 01/57085 

1413 

gaggggggac tcgccaacat agttcatcat tga 

<210> 23 
<211> 1075 

< 2 1 2 > DNA 

<213> Homo sapiens 

< 2 2 0 > 

<221> misc_feacure 

<223> Incyte ID No: 7472034CB1 

<4C0> 23 ^^^^ Aftrraataa tattacaaaa caaggttctt 60 

tctacccaaa tgagaaggaa ^agagaaaca attctggtaa ^^^^^^^^^^ gatgaaatct 120 
taacacctcc gaaggatcaa ^ccagctcac ^^^^^^ gctgatcaaa gaggcaccag 180 

ctqaattacc agaaaaagaa ttcagaagat ?gcactgccc ctgggcatcc 240 

aaaaagggat tccaggttta ^aggaaagcc ^^^^^^^^^^ catcat?tgg atggacccat 300 
tttacctcct tgctttagtg ^^^^^^9"^ ccatgctagc tgccatcgac ctggttctgg 360 
ccttgcacca atctatgtac ctcttcctgt ^^^^^^ ^ tcatgcccac gagattgggt 420 
ccccc.ccac tgcacccaaa ^cccttgcag tgctcctggt ^ca^g^^^^^ gagtcagggg 480 
acatcgtcrg cctgatccag ^tgttcttca ^"^^^^^ tcaccccttg caccattcca 540 
tacttgtggc catggctctg ^atcgctatg [^flll ^ ggtgccggtg aggggattac 600 
caatcctgca tccaggggtc ^^f^?^^^^^ gaa^actlat cttctgccaa gccaccacca 660 
^_,^^,^tt-at ccccttcccc attttgtcgg gaciL,ci t-rrr-r r ctctca aaaaccacag 720 

r^agg^catgc ctattgtgaa catatggctg tcgtgaaact t^-tgct^a 9^^^^^^^^, ,30 
ccaltcgagc ttatgggctg actatggcct ^gcttgtgat ggs^^g agtgaggccc 840 
-tggtgtttc ctatgcccac atcctccagg catcctggto ttccatgtcc 900 

aacttaaggc gtctagcaca tgtggctctc atatttgtgc catgtccatg 960 

^tggaat??t ctcctccccc actcaccgct ttggtcatca tgtac^^^^^ gtctatggag 1020 
ttcttctggc cacacggtat =^^^^^^^9^ tcagagtgtt tacacaaaag gattga 1076 
tcaaaactca gcagatccgc cagcgagtgc tcagag y 



tgaagactca gcagatccgc cagcgagtgc 

<210> 24 
<211> 948 
<212> DNA 

<213> Homo sapiens 
<220> 

<^21> misc feature 

<223> incyte ID No: 7472035CB1 




ggggtta^gg ctttgtacta tctacctggg ^tggcatcca ----- gat^atScca 840 
?!?^atgtag tgcccttgca cacccaagtc |tgctagc^9 Lct^cggga gagaatatgg 
I^tfat^tga ^gc-g c "^^cca? t??aacctgg gttcatga 



<::io> 2 5 

<211> 945 

<212> DNA 

<213> Homo sapiens 

<22C> 

<'''21> misc_feature 

<223> Incyte ID No: 7472036CB1 



22/31 



BNSDOCID <WQ . 0157De5A2 



wo 01/5"'<»85 



PCT/tS01/t»i455 



< 4 C ■ J > 2d 
atgt cagcc t 
ggrztggaac 
cngrttggaa 
at gcacctc t 
cczaaaatgc 
gc r ragatgt 
gc r r t cgacc 
icrrtcatca 
c t r c c c 1 1 c c 
tgtgaacaca 
ggcatcgctg 
ate t ttattc 
gggacatgtg 
tcagtcatgc 
t tc tatctgc 
atccgtgaga 



ccaatatcac 
acc t gcaca t 
actgcactc t 
ttc tggccat 
ttgccatatt 
tctt cct tea 
getatgtggc 
eeaagat tgg 
tgetgagatg 
tggctgtggt 
tggecatgtt 
t teaggcagt 
tc te tcatat 
acegtgtagc 
tct tcccacc 
gcat ct tggg 



c t taacacat 
etggatetcc 
ccttcteatc 
gttggcagcc 
-tggtteagg 
tec ttc tec 
tatc tgcaag 
eatgge tgct 
t ttccactac 
gaggctggcg 
tattgtgg tg 
tctactgc tt 
aggtgccatc 
ccgccatgct 
catggtcaat 
agtattccca 



ccaac tge c t 
atceetttct 
atceaggctg 
atcgacctgg 
gategggaga 
at catggagt 
ccactgeact 
gtggcecggg 
tgccgaggcc 
tgtggggaca 
t tggacc tgc 
gcctc tcagg 
t tagcc t tc t 
gcccc tcatg 
cccataatc t 
agaaaggata 



t c t tgt tggt 
ge t tagca ta 
a tgeagecc t 
t cct t tee tc 
taaac 1 1 3 1 1 
cagcag tgct 
acaccaaggt 
c tgtgacact 
cagtgatcgc 
e tagct tcaa 
tccttgttat 
aggcecgc ta 
acacaaetgt 
tccacatect 
atggtgteaa 
tgtag 



gggga t tcca 
tacac tggcc 
c eatgaacce 
c tcageactg 
tgectgtctg 
gctggccatg 
cctgae tggg 
aatgac tcca 
t cac tgc tac 
caa tat c tat 
ec tg tc t tat 
eaaggcattt 
ggtcatcte t 
cc t tgccaat 
gaccaagcaa 



120 
130 
240 
3 00 
360 
420 
480 
S40 
600 
660 
720 
730 
840 
900 
945 



< 2 1 0 > J 6 
<211> 966 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> IncytG ID No: 



7472037CB1 



<400> 26 
atggc tcatc 
c tgc tggtgg 
c tcacctacc 
cgcactctcc 
tacacac tgg 
tccatcgcgg 
ttcc tec tta 
tc tg tgatca 
1 1 c teagtgg 
■ratgtggagc 
tcacteaagg 
t t cattc tec 
ggceggcgea 
ggttgtgcct 
ctggtggcgg 
aggaacaagg 
aagtaa 



aggcgcctga 
gat tctccaa 
tggtcacac t 
acacacc tat 
tcaccatccc 
getgtcgggc 
ccctgatggg 
tgagaeccae 
ee t tgattga 
ac t te ttc tg 
gaet tggcat 
tctcctacgc 
aagcc ttete 
ccatcatc ta 
tgttctaeac 
aggtgagggt 



gaagcagcag 
ectcccagaa 
eagtggcaat 
gtaccgc t tc 
caatatgc tg 
ccagatgt tc 
e tatgacagg 
cgtctgcctg 
gacctgcatg 
tgacat tgcg 
cttc t tec tg 
cttcattgtg 
taectgcgea 
eetgaggccg 
egtggtgaca 
agege tgagg 



gacaatggga 
ctgaggccca 
gceaccatea 
c tggcegtgc 
gc tcatc tgc 
ttettcc tag 
tatgtggeea 
tgtttgggag 
atcttc tec t 
ce tgtgc tga 
agcatcc teg 
gctgecattg 
geeeace tea 
gactetgggg 
ccgc tgc tga 
aaaaacc tgg 



cctggc tggt 
e tctcttcat 
tcaccatcat 
tc teec tctc 
tgatggagag 
get tgggt tg 
t c tgccatcc 
cettggtttt 
caecc t tc tg 
age tcagetg 
tggtgc tggt 
tgagga tccc 
cggtggtcat 
c taatccc tc 
accctgtggt 
cacggggctg 



gacagagttc 
c t tgt tec tc 
ccaggtggat 
tgagacc tge 
ceaggceatc 
cagccattgt 
e ttgcge tac 
c tge tc tggt 
tggcgcaggc 
tgatgagagc 

t tcggcc tc t 
cgtaca t tt t 
ecaggaecgc 
t tacaccctg 
tggagcat tt 



<210> 27 

<211> 996 

<212> DNA 

<2 13 > Homo sapiens 

<220> 

<221 > mi sc_ feature 

<223> Incyte ID No: 7472039CB1 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

966 



<4 00> 2 7 
atgagtcc tg 
aatc tcaaca 
ctgacaggca 
atgtacttet 
ccaaagatgc 
aeccaattct 
tc tgcggatc 
ge tgtgtgc t 
cceacagtgg 
ttetgcgaca 
actgactttg 



atgggaacca 
gcgeaagag t 
atgtgt tgat 
t tc tgggtaa 
tgageaat 1 1 
atttctactt 
getacc tggc 
t tcgtgtggc 
c tgtggee 1 1 
gtggcccae t 
tectggcctc 



cagtagtgat 
ggaat tat tt 
tgtgggggtg 
ce tgtcc tgc 
cctetcaagg 
ett tctcggg 
catc tgteat 
cttggcctgc 
get tec t ttc 
gc tecgce tg 
cctcgtcatt 



ceaacagagt 
tctgtgt ttc 
gtaagggctg 
ctagagatac 
caacaeae ta 
gectccgagt 
cc tc tgege t 
tgggtggggg 
tgtaageagg 
gc t tgeacea 
gtatcttcet 



tegtec tggc 
t te t tgtc ta 
atactcgac t 
tgetcac t te 
tttcctttgc 
tettactgtt 
aeccc t tgc t 
gaetcgtcce 
gtgctgtggt 
aeaecaagaa 
tgctgatcac 



agggc t ceea 
t e tec tgaat 
acagacccet 
tgtcatcatt 
tgcatgtatc 
ggc tgt catg 
eatgagtggg 
tgtgc t tggt 
acagcac t tc 
gctggaggag 
tge tgtgtee 



60 
120 
180 
240 

2 00 

3 60 
420 
480 
540 
600 
660 
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W O (»l/57(»85 

tacggcctca ttgtgctggc agtcctgagc accccccccg --aggccg -agaaggcc 720 
ttctctaccc gtacctccca cttgatagtg ^^^^^^^^ct ^9 y .^^^^gtaata 840 

ctctatgtgc ggccatcgca gagtggttct 9^99^=^=^^ cctflcgtaa tgagcaagtc 900 

acgacatrcg tgacaccact gttgaatcca " ^^^^ g gaacctttta 960 

aaaaaaqctt tgaaggacac gtttaggaag gcagt,,gcag gcgc.. ^ 
cttStaaat g^ctcagtga gaaagcagta aagtaa 



<i:0> 23 
<:111> 1014 

< 2 1 2 > DNA 

<2:3> Homo sapiens 

< 2 2 0 > 

<-->-'l> misc feature 

<223> Incyte ID No: 7472040CB1 



-cntatgccg ccatcttcct ggtgggggtg ==^99^""^^^ tccacctggc cgtggcggat 240 
r.'jaaggtgg cccgccggag ggtgggtgcc acctggttgc ^ccacc gg ^ ^ 300 
rt.'rtg?g?t gtttgtctct gcccatcctg Q^agtgccca ^tgcccg gg ^99 ^^^ 
--'-ra-ggrg caacgggctg tcgggcgctg ccct^cacca ^__„^y rgggcctgcc 420 

a;,-gt"--gc tcctggcagc tcccagtgcc gacctctg.t ^--99-^^" agcctggaca 480 
taatggtcL cggttcagcg 9gcgtgcggg 9tgcaggtgg cctgtgggg^ g^agclcttc 540 
ctggccttgc tgctcaccgt gccctccgcc acctaccgcc 99 y gaatgcggtg 600 

cclacccggc tgcagtgtgt ggtggactac 9gcggctcct 9 |^ cagctgccac 660 
actgccatcc ggcttctttt tggcttcctg g9gcccctgg 99 cattgtggtg 720 



<210> 29 

<211> 5122 

<212> DNA 

<213> Homo sapiens 

< .";20> 

<"^1> misc_feature 

<223> Incyte ID No: 4250893CB1 



-400> 29 ^ ^^r,r,^ ^r-taaaaacc cattccctgc tctgcggctc 60 

cagccagcag cccggggctc cactctgggt ^ctgaaagcc = gtccaggacc 120 

ctcccacccc acctcttctc agccttgcag ^ccaagggtt 9 cccatctccc 180 

caggagaggg aagaatctga gg|acacaga -agtgagcg ^^9 ^^^^^ atcccaggac 240 
gtcaccacat ctcccctctg taacaccctc ^= « tctgggctcc tccctcccct 300 

ctccctatca gctgacttct tccagtgtct ^9caggcccc ^ 9gy ggatttataa 360 

,.,cttttcct accactcccc ctctatcggc 9tctat tgt agg 9 9 ^^gcactgtt 420 
aactaggttc cgaatgctga ataagagacg g^aagagcca 99 cttaactccg 480 

-tctgcctgc ctgataccct caccacctgg 9aacatcccc ^ag ttcctgaaga 540 

,TgacIgagat ggctggcgga g^ctggggcc 9^=^9gcctg ^ta gg^y aggagctctt 600 
aggaggagct gaaggagttc cagcttctgc ^cgccaataa 9 y tcgtacctgg 660 

rg5g?gagac acccgctcag ccagagaaga ^9agtggcat 9gagg gg g^.gg 720 

rggctcagta tggggagcag cgggcctggg ^^=^^9ccct yy ^^^^^^^^^^ 780 

ggctgaggtc actgtgcgcc caagcccagg aaggggcagg ^ gcagtgctaa 840 

icagcclLg tgaaccccac ctggggtctc ccagccaacc -acc ^gggttttga 900 

tgccctggat ccatgaattg --g9cggggt g^^^^^^H^ ^tctgcctca cacgtctacc 960 
gacagctgcc tgacacatct gga<=g^^9Ct 9gay y ggagtcaccc aacgccccca 1020 

aagctcttcc aagctcccca gaccatgagt ^tccaagcca 9g g gcacccagag 1080 

calccacagc agtgctgggg agctggggat gtcaggaatt tactacacag 1140 

agcaggaggc tcctgggacc caatggcctc cggatgaaac gtcjg^^^^ gcagcggtgg 1200 
aaatcagaga aagagagaga gagaaatcag agaaagg y ^ tgggagcctt 12b0 

.aggaacgcc cccacaggcg cacaccagcc ^acagcccca = ^^^^^^^.^ aaccaaaaat 1320 
ctgtgagaga gagcctctgt tccacacggc yy 
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BNSDOCiD <WC ;ji57065A2 



wo 01/5^ns5 



PCT/LS*n/03455 



i racacagct: gctacf^cta caaagac c tc ac cccagaag c caagat ccc c tggccaaga 1 3 S 0 
gaagcnggcc tgat-angtg gaggagaatc gaggacactt aattgagatc agagacttat 144 3 
icggcccagg ccnggacacc caagaacctc gcatagtcat actgcagggg gctgctggaa 1500 
1 Z jggaagcc aacac tggcc aggcaggtga aggaagc ccg gqggagaggc cagctgtarg 1550 
gggaccgc::^ ccagcatgtc ttctacttca gctgcagaga gctggcccag ::ccaaggtgg 162C 
•:gsgtctrgc cgagctcatc ggaaaagatg ggacagccac tccggctccc a::tagac:aga 168C 
ccrtgtccag gccagagcgg ctgctctcca tcctcgatgg tgtagatgag ccaggatggg 1740 
':.c':t:gcagga gccgagttct gagctctgtc tgcactggag c cagccacag ccggcggatg 18 00 
racrgcnggg cagtctgccg gggaaaacta tacttccrga ggcatccttc ctgatcacgg 1860 
rtzggaccac agctctgcag aacctcattc cttcrtrgga gcaggcacgt tgggtagagg 1920 
ccctggggtr ctctgagtcc agcaggaagg aatantrctia cagatat ttc acagatgaaa 19 SO 
ggcaagcaac tagagccttc aggttggtca aatcaaacaa agagctctgg gccctgtgtc 2040 
ttgtgccctg ggtgtcctgg ctggcctgca cttgcctgat gragcagatg aagcggaagg 21C0 
aaaaactcac actgacttcc aagaccacca caaccctctg tccacattac cttgcccagg 1160 
rtrtccaagc tcagccattg ggaccccagc tcagagacct ctgctctctg gctgctgagg 1220 
gcatctggca aaaaaagacc cttttcagtc cagatgacct caggaagcat gggttagatg 2280 
gggccatcat ctccaccttc ttgaagatgg gtattcttca agagcacccc atccctctga 2340 
gctacagctt cattcacctc tgtttccaag agttctttgc agcaatgtcc tatgtctcgg 2400 
agga tgagaa ggggagaggt aaacat tc ta attgcatcat agatt tggaa aagacgc tag 2 4 60 
aagcatatgg aatacatggc ctgtttgggg catcaaccac acgtttccta ttgggcctgt 2520 
taagtgatga gggggagaga gagatggaga acatctttca ctgccggctg tctcagggga 2 580 
ggaacctgat gcagtgggtc ccgtccctgc agctgctgct gcagccacac tctctggagt 2640 
rcrtccactg cttgtatgag actcggaaca aaacgttcct gacacaagtg atggcccatt 2700 
tcgaagaaat gggcatgtgt gtagaaacag acatggagct cttagtgtgc actttctgca 2760 
ttaaattcag ccgccacgtg aagaagcttc agctgattga gggcaggcag cacagatcaa 2820 
catggagccc caccatggta gtcctgttca ggtgggtccc agtcacagat gcctattggc 2880 
agattctctt ctccgtcctc aaggtcacca gaaacctgaa ggagctggac ctaagtggaa 2940 
actcgctgag ccactctgca gtgaagagtc tttgtaagac cctgagacgc cctcgctgcc 3000 
tcctggagac cctgcggttg gctggctgtg gcctcacagc tgaggactgc aaggaccttg 3 060 
cctttgggct gagagccaac cagaccctga ccgagctgga cctgagcttc aatgtgctca 3120 
cggatgctgg agccaaacac ctttgccaga gactgagaca gccgagctgc aagctacagc 3180 
gactgcagct ggtcagctgt ggcctcacgt ctgactgctg ccaggacctg gcctctgtgc 3240 
ttagtgccag ccccagcctg aaggagctag acctgcagca gaacaacctg gatgacgttg 3300 
gcgtgcgact gctctgtgag gggctcaggc atcctgcctg caaactcata cgcctggggc 3360 
tggaccagac aactctgagt gatgagatga ggcaggaact gagggccctg gagcaggaga 3420 
•aacctcagct gctcatcttc agcagacgga aaccaagtgt gatgacccct actgagggcc 3480 
tggatacggg agagatgagt aatagcacat cctcactcaa gcggcagaga ctcggatcag 3540 
agagggcggc ttcccatgtt gctcaggcta atctcaaact cctggacgtg agcaagatct 3600 
tcccaattgc tgagattgca gaggaaagct ccccagaggt agtaccggtg gaactcttgt 3660' 
gcgtgccttc tcctgcctct caaggggacc tgcatacgaa gcctttgggg actgacgatg 3720 
acttctgggg ccccacgggg cctgtggcta ctgaggtagt tgacaaagaa aagaacttgt 3780 
accgagttca cttccctgta gctggctcct accgctggcc caacacgggt ctctgctttg 3840 
tgatgagaga agcggtgacc gttgagattg aattctgtgt gtgggaccag ttcctgggtg 3900 
agatcaaccc acagcacagc tggatggtgg cagggcctct gctggacatc aaggctgagc 3960 
ctggagctgt ggaagctgtg cacctccctc actttgtggc tctccaaggg ggccatgtgg 4020 
acacatccct gttccaaatg gcccacttta aagaggaggg gatgctcctg gagaagccag 4080 
ccagggtgga gctgcatcac atagttctgg aaaaccccag cttctccccc ttgggagtcc 4140 
tcctgaaaat gatccataat gccctgcgct tcattcccgt cacctctgtg gtgttgcttt 4200 
accaccgcgt ccatcctgag gaagtcacct tccacctcta cctgatccca agtgactgct 4260 
ccattcggaa ggccatagat gatctagaaa tgaaattcca gtttgtgcga atccacaagc 43 20 
cacccccgct gaccccactt tatatgggct gtcgttacac tgtgtctggg tctggttcag 4380 
ggatgctgga aatactcccc aaggaactgg agctctgcta tcgaagccct ggagaagacc 4440 
agctgttctc ggagtcctac gttggccact tgggatcagg gatcaggctg caagtgaaag 4500 
acaagaaaga tgagactctg gtgtgggagg ccttggtgaa accaggagat ctcatgcctg 45 60 
caactactct gatccctcca gcccgcatag ccgtaccttc acctctggat gccccgcagt 4620 
tgctgcactt tgtggaccag tatcgagagc agctgatagc ccgagtgaca tcggtggagg 4680 
ttgtcttgga caaactgcat ggacaggtgc tgagccagga gcagtacgag agggtgctgg 4740 
ctgagaacac gaggcccagc cagatgcgga agctgttcag cttgagccag tcctgggacc 4800 
jgaagtgcaa agatggactc taccaagccc tgaaggagac ccatcctcac ctcattatgg 4860 
aactctggga gaagggcagc aaaaagggac tcctgccact cagcagctga agtatcaaca 4920 
ccagcccttg acccttgagt cctggctttg gctgaccctt ctttgggtct cagtttcttt 4980 
ctctgcaaac aagttgccat ctggtttgcc ttccagcact aaagtaatgg aacttatgat 5040 
gatgccttgc tgggcattat gtgtccagcc agggatgcac agggggccca gtcaggtggg 5100 
ctacagcatc tcagggatgt cc 5122 

<210> 30 
<211> 1241 



25/31 



BNSDOCID <WG 



PCT/LS*H/(»3455 



WO 

<212> DNA 

<213> Homo sapiens 
<220> 

<^^1> misc feature 

<223> incyte ID No: 6726656CB1 



:t:^^.. .caaccaaca aat.acacag caaC.aa.tc ...actga ccccaca.- 
ttgagcctag acactagctg tgtcactaac ^ctctgtg g cCgctgtgat 180 

aaigitttga taaacgtgga aaataaccgc tgtt ga ^^^^^^^^gt ttatcCctcc 240 
gtgaggtgtg gaagaaacac gggagcagcc ^^ccu gg ^ ^^^^^ gaaacccgtc 3 00 

agct?lglaa tcacatgaag ^tgtggaCgg ^gagccacc^ ^ttcltcctg cagggctttt 360 
ccagcccaag gatgatgagt ^f^^^gacgt ^99"* g ^^tcttcctc tactctgggg 420 
cggagcaccc agaataccgg ^tgttcttat ^cagctgt ccccacgctc 430 

cictlacagg taatgtcctc cttcgga^at tatctgcacc tcttccatca 540 

ctatgtactt tttcttactc ^f^^ggcta f^acgga ^^^^^^^ gggggctgca 600 

tgcclaaggc gctggccagt ^^ggtgtcgg gccgctgctc ctcacggtca 660 

tggcccagct ctatttcctc ^^^g^gggctg 9^^^ atgatgagca 720 

cggcctatga ccggtacgca gccatctgcc ctgcgccgtc aacacggcca 780 

aggtgttc?g cagcgggctg ^'^-^--^lll ^f^g^ggccc caltgtcatt atccatttct 840 
tccacacggg gctgacgctg ^^^^tggatt ^ctg gg ^^^^^acgtc aacggtgtca 900 
-^t-T-Ta-a«- -rctcccctg ctgcttcccc ccv.y>- y r-r^aaraa.'-r atcgcgtcct 960 
^gartftfct ggcggatgct ttcta ggca -g^g-ctt c-^-^^^ cagLagcct 1020 
atggcttcat cgtctccagc ^^cctgaagg ^^^^l ttacaccgct gtcttctacg 1080 

tc?lcacctg ctcttcccac ^^caccgtgg ll^flJ caagttggct ggcctgctgt 1140 
cctacataag cccggtctct ^^^^^^^^^^ tca??talac tttgagaaac aaggaggtca ..00 
acactgtgct ^agtcctacc ^^^^^^^^^^ tcagaaatta a ^^^^ 
aagcagccct caggaagctt ttcccctucu 



<21G> 31 
<211> 1155 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature ^ 
<oo3> incyte ID No: 7472062CB1 



ItimU t.=e„=„. ;tS:il?S clllll'Hl tll-lTcT. 'jSo 

agJct.cMt cagcocccaa o;""''" „Lagataa aaggggattg tttcascac. 180 

tllfdllll %l^A%Ti "" I «»4.;» r.Sl?"co fo 

tctcatctcc acactcccat S^actttCtt f^^l ttatagaaga caaagccacc 4 50 

tactcctcag ctgtcactcc ^^^ggtttta ^=^999 | tctttgccac tgtggaaaat 540 

tcctacagtg cctgtgctgc tcagatgttc ^^^^^^^ ^ tqtgtaaccc cctacattat 600 

?acctct?gt cctcaatggc -^atgaccgc t-gc^gcag ^f^ tgtcattggt 0 

accaccacca tgacaacacg ^gtgtgcgct ^9^^ ^g tctctttccg catgcccaat 720 

:^gK?".« 

gattcagtcC tttaa 



<210> 3 2 
<211> 1260 
<212> DNA 

<213> Homo sapiens 
<220> 

26/31 



BNSDOCID <WC 0l57Oe5A2 i > 



w o 01/5''<IK5 



PCT/LSOl/03455 



<221> misc_featurG 

<223> Incyte ID No; 7472067C31 

•:';C0> 3 2 

a-:qctggcag ctgcccttgc agact^taac nccagcagca tgaatgngtc ctttgctcac 50 

c^^ccacttcg ccggagggta cctgccctct gattcccagg actggagaac catcatcccg 12C 

gct-trttgg tggctgtctg cctggtgggc ttcgtgggaa acct:g::gi:gt: gattggcatc 18C 

rtcztccaca atgcttggaa aggaaagcca tccatgatcc actcccLgac tctgaacctc 24C 

agcctggctg atctctccct cctgccgttt tctgcaccta tccgagccac ggcgtacccc 300 

aaaagtgttt gggatctagg ctggtttgtc tgcaagtcct ctgactggtt catccacaca 360 

tgcatggcag ccaagagcct gacaatcgtc gtggtggcca aagtacgctt catgtatgca 420 

agtgacccag ccaagcaagt gagtatccac aactacacca tctggccagt gccggcggcc 480 

ai'-tggactg tggctagcct: gttacccctg ccggaatggt tctttagcac caccaggcat 540 

-jatgaaggtg tggaaatgtg cctcgcggat gtaccagctg tggctgaaga gtttatgtcg 600 

atgtttggta agctctaccc actcctggca tttggccttc cattattttt tgccagcttt 660 

taitnctgga gagcttatga ccaatgtaaa aaacgaggaa ctaagactca aaatcttaga 720 

aarragatac gctcaaagca agtcacagtg atgctgctga gcattgccat catctctgct 780 

ri:ccgtggc tccccgaatg ggtagcttgg ctgtgggtat ggcatctgaa ggctgcaggc 340 

rj.igcrccac cacaaggttt catagccctg tctcaagtct tgatgtttitc catctcrrca 900 

graaaccccc tcatttttct tgtgatgtcg gaagagttca gggaaggctt gaaaggtgta 960 

Cii jaaatgga tgataaccaa aaaacctcca actgtctcag agtctcagga aacaccagct 1020 

j-.j ^ciactcag agggtc ttcc tgacaaggtt ccatctccag aat ccc cage atccatacca 108 0 

jcijaaagaga aacccagctc tccctcctct ggcaaaggga aaactgagaa ggcagagatt 1140 

c::.azcctzc ctgacgtaga gcagttttgg catgagaggg acacagtccc ttctgtacag 1200 

garaatgacc ctatcccctg ggaacatgaa gatcaagaga caggggaagg tgttaaatag 1260 

< : : 0 > 3 3 

< 2 1 1 > 9 4 5 

< : : 2 > DNA 

<^13> Homo sapiens 

< 2 2 0 > 

■' 2 2 1 > misc_f eature 

<223> Incyte ID No: 7472072CB1 

<400> 33 

ai ^jggggatg tgaatcagtc ggtggcctca gacttcattc tggtgggcct cttcagtcac 6 0 

^-raggarcac gccagcCcct cttctccctg gtggctgtca tgtttgtcat aggccttctg 120 

ggraacaccg ctcttctctt cttgatccgt gtggactccc ggctccatac acccatgtac 180 

t':ctgcnca gccagctctc cctgtttgac attggctgtc ccatggtcac catccccaag 240 

a:.,^3catcag actttctgcg gggagaaggt gccacctcct atggaggtgg tgcagctcaa 300 

aiattcttcc tcacactgat gggtgtggct gagggcgtcc tgttggtcct catgtcttat 360 

gaccgttatg ttgctgtgtg ccagcccctg cagtatcctg tacttatgag acgccaggta 420 

tg^_::tgctga tgatgggctc ctcctgggtg gtaggtgtgc tcaacgcctc catccagacc 480 

tc-atcaccc tgcattttcc ctactgtgcc tcccgtattg tggatcactt cttctgtgag 540 

gtgccagccc tactgaagct ctcctgtgca gatacctgtg cctacgagat ggcgctgtcc 600 

ao 3tcagggg tgctgatcc t aatgctccct ctttccc tea tcgccacctc ctacggccac 6 60 

gt^ttgcagg ctgttctaag catgcgctca gaggaggcca gacacaaggc tgtcaccacc 720 

tgrtcctcgc acatcacggt agtggggctc ttttatggtg ccgccgtgtt catgtacatg 780 

gtgccttgcg cctaccacag tccacagcag gataacgtgg tttccctctt ctatagcctt 840 

gtoaccccta cactcaaccc ccttatctac agtctgagga atccggaggt gtggatggct 900 

ttqgtcaaag tgcttagcag agctggactc aggcaaatgt gctga 945 

<:2 10> 3 4 
<: 2 1 1 > 7 6 5 

< 2 1 2 > DNA 

-:2 13> Homo sapiens 
<: 2 2 0 > 

<221> mis cofeature 

<223> Incyte ID No : 7472C74CB1 

<400> 34 

at;jgcctc^c ggtatgtggc agtgggaatg atcttatcac agaccgtggt gggagtcctg 60 

gggagcttct ctgttcttct oca t Cat etc tccttttact gcactgggtg caggttaagg 12 0 

tccacagatt tgattgttaa gcacctgatt gtagccaact tcttagctct ccgctgtaaa 180 

ggagtccecc agacaatggc agcttttggg gttagatatt ttctcaatgc tcttgggtgc 240 
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BNSDOCID <WC 3157C'85A2 i > 



wo <H/57(»S5 



PCT/US01/*I3455 

300 



gagaaagccc ccaagcatgt tggcttt.cc ^"^^I^^^ ^ctacacaaa catcacagtg 480 

Bs;sx pip ;f,r,r-4 r."«r.?= rf? 



li iiSi iiii iSS lits-" ™- 



tcca 

CLCCC 

acccttgtg 



<21C> 35 
<21l> 1089 
<212> DMA 

<Z13> Homo sapiens 
< 2 2 0 > 

<223> Incyte ID No: 7472077CB1 



<400> 35 ^ ^=.^tr.rT^ aactattttc ttctctgtga cgccgagggg 60 

ata-acaagg accgcatcga gtccactgga ^^^^^^^ tcatggtcac aattctgcta 120 

i~ ill ! i 

gctctctgtt tctcatgcct ^ttagctcat l^^H ttggttgcag tccgttgcaa 420 

tgtgtctcct tctcctggac gacaattctg ^ gaggtacgat gtttgtgaat 480 

;?clttattg ccactgagta tgtgactctc jtcatgacca 5 99^^^^^^ ccccttcccg 540 
atgacaccct gccagctcaa tgtggacttt |^tg gcccgtgtga gaactggaag 600 

atggccctca cattcttcgt ^^ccaaagcc ?catcatctg ggtggtgtgg 660 

caglatggaa ggctcatctt tatcactgtg agccccagtg ggacgacccg 720 

atctccatgc tcctgagagg caacccgcag ^tccagcg catcgtccct 780 

gt-gtctgca ctgctctggt caccaacgca ^g^^ttccc ^ ^ caatgcctgc 840 

qagctctgca ttctctacag atcgtgtaga c^ggagtgcc cagagacaaa 9 00 

I'Stcacag cctaccaaca cagcttccaa ^^ggagaacc agg^g ^g^cagtgat 960 

tggLggtct tactcaactc gg^^"ccta ^cacacag g ^^^^^^ gcagactgtt 1020 

-^fcSfc llllltltll cllcTcllll Tolll^o... gcccccagca agatgcagga 1080 



ggagrataa 

<210> 36 
<211> 1334 
<212> DNA 

<213> Homo sapiens 
<220> 

<^21> raisc_f eature ^^^^^^ 
<223> incyte ID No: 7472082CB1 



gcttcagaag .ggcaatagc actcca ^.tg-ac^^^ ^^fcaac^ta % 

g^atggcagc ccaggaaggc -caagaaca -gcaaactg l^J^l^^^^ tgacaagatc 180 
^attccaaga aagagtgcat gtacattaag tggctgggct ctcagaacag 240 

gtggaccctg gaaaccattc ^tcagtgact ^-^^^^^^^^ tc?gtgtggt cacagtggtg 300 
ccagagctcc agctgcgcct ^"^^^^^^^ ctcaatlctc acctgcacac acctatgtac 360 

-ssss^ tis ?t^?fai=:c 

?r4?^ro^? ^^IcaS^^S r,r.r„aS l t«|a.«ac a,»=a»cc e 

aotttt»?gt cgag.ctctc tccgtgcaag ;«"t9^9| :"atgaatt actggLtctg 780 
"tttcccte tctcggggct cccccgctcc agcacctaca ^ ctacatcttt 840 

,;c"tgMtg catttaac.t cetgatgccn S""""» g^"cLagc cttc.gc.ct 900 
J;=at?gica gcatcctccg ja;";"-? ?lg«agcatt c.tgt.ccCg 960 

crg:S«??c ;I^^S;lag ;;g..ag?gt =gtctgtg« «a..=ra« 
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attgtgcc-a tgc tgaac cc cctgatctac agcctgagga ataaagatgr caaacttgcc 1030 

gtgaagaaaa ttctgcatca gacagcatgt taaCgaatag aatcaatgtt atgttgtrac 114 0 

at zaagatag gtctttggtt tgattagaca ccnaact: tat tggact tatt gttgagatr:!: 1200 

atgaaaacrt agtgatgctc tttnatgtaa cacctctcca aaatatrcct ccggtctgct 1260 

tccatcgaac ttatattcca atgagcatan gtaaagaaat acaaagaata aaatcaaaag 1320 

actcccgagg ttta 1334 



<21G> 37 

<211> 960 

< 2 1 2 > DNA 

<213> Homo 



saDi ens 



<220> 

<221> misc_feature 
<223> Incyce ID No: 



7472128CB1 



<400> 37 
atgacacc tg 
ttgcaggga t 
at c tatgcca 
catc tccaaa 
tc ttctgtgg 
tt tgagggct 
c tgc tggcc t 
tc tatcatga 
gccaac tccg 
ctaacacac t 
acagcaagag 
at tc tcacc t 
agggagaagg 
ac tgtggttt 
gtgtccgtc t 
aacaaggagg 



gagaac tagc 
tc tccaat ta 
taacagtggt 
gcccaatgta 
tcacacc taa 
gtgtggtcca 
ccatggccta 
ccagggggac 
c tatccagac 
actactgtga 
tggtcctcta 
cctactgct t 
acctc tccac 
tcacctatg t 
tc tacaccat 
tgaagggcgc 



cc t tgccagt 
tccagacc tc 
gggcaac t tg 
1 1 tct tec tc 
gc tcttggtc 
gc tcgcc t tc 
tgaccgc t tc 
ctgtctccag 
tggaaatgtc 
cataccaccc 
tgtcttttct 
ggtcttggtg 
ttgtgcctcc 
tcagccccat 
cataat tccc 
tc tgcagagg 



ggcaaccaca 
caggagc t tc 
ggaatga tgg 
aatgtcc tc t 
aact tec tgg 
tt tgtag tgc 
ctagccatc t 
ctggtagctg 
tt tgccc tgc 
ct tctccacc 
gc tctggtca 
gccattggga 
cact ttctgg 
ggatctac ta 
atgc tcaatc 
aagc t tcagg 



ccccagtcac 
tct tcggagc 
cac tcatctt 
cgtt tc t tga 
tc tc tgacaa 
atgtgacagc 
gtcaacccc t 
tgtcctatgc 
ctt tctgtgg 
tggcttgtgc 
ccc ttc tgcc 
ggatgcgctc 
ccattgccat 
acaataccaa 
cct tcatcta 
tcaacatc t t 



caagt tcatc 
catcc tgc t c 
cacagactcc 
tatttgttac 
gtccatc tct 
tgagagcttc 
ccat tatggt 
atttggtgga 
gcccaaccag 
caacacagcc 
tgc tgcagtc 
agtagcaggg 
t ttctatggc 
tggccaagta 
tagcctccgc 
tcccggc tga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



<210> 38 
<211> 939 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



7472134CB1 



<400> 38 
atggacacag 
cagggtgtcc 
ggaaacc tgc 
cact ttgtca 
atgc tggcaa 
atctatttc t 
gataggtat t 
tgtgcagaga 
tccttgatt t 



gt tataaat t 
atcatc tgca 
tgtgcc tccc 
cage tgaaga 
c tcacaccc t 
g tgaggaggc 



ggaactggag 
agat ttatct 
tgatattcc t 
gcat tc tctc 
act tgc tcag 
t tcac tccc t 
tagccatctg 
t tgccat tgg 

tgc tgagt t t 
cctgcaagat 
cagtgctcag 
ac t tcactgt 
agagc tactc 
tcctcaaccc 
age taaagag 



ccaggtagca 
ct tcctc t tg 
ggtggtc tgc 
ct tc tcagag 
tgagaaaaag 
tggagcgac t 
ccggeecc tc 
e tgt tggt tg 
at tctgtggc 
gge t tgcac t 
cc tagccaec 
aat tccc tea 
ggttc tcatc 
ac tggactat 
c ttcatc tac 
aat tgggata 



gaattcatca 
t tgcttc tea 
c tggactcec 
ct tggetata 
accat ttcat 
gagtge tate 
cac tacccaa 
ggaggcttgg 
cecaatcgca 
gatacgtc ta 
t tec tgc tga 
gc tgecggca 
tte tatggga 
gaccaggeec 
age t tgcgca 
t tggeatga 



tcttgggc t t 
t ttace teat 
ggctteacac 
cagctgceae 
tc tetgggtg 
tee tgacage 
ecctcatgae 
c tgggeeagt 
t teageaegt 
taaatgtcc t 
tcctctgc tc 
agaggaaggc 
gcatce t t te 
tggcagtggt 
aeaaggagat 



<210> 39 
<211> 968 
<212> DNA 

<213> Homo sapiens 



cceccatcte 
gaetgtgt tg 
aeeeatgtac 
catcce taag 
tc tee tgcag 
tatggcc tac 
eceaacac t t 
agttgaaat t 
c t tttgtgac 
agtaga tt tt 
c tatgtgcag 
catc teeacg 
catgtatgtg 
c tactcagtg 
caaggaggc t 



60 

120 

ISO 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

939 



:20> 
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VV O 01/5708?; 

<223> Incyte ID No: 7472136CB1 



lllZAl.. 3.c.t„a.t agc,,t„,a «,c«.,.c „cc«aa.c t«.c=.,« p 
ggSgLc.gt gc.cacggcc t.cctggtgc wg«=cct 9g__^ cacagg.acc 130 

o'aaaatggc aggcaaeagc atggtgacet «f"S- ^"J cetcttcctc cccagcatgg 240 
iftdlgclc ctatatcccc aacctggcgg Xactgac.g gtcc.cgagc 300 

!S"Sfg;5 Sgf^?" r.rg ? ■ 
i rj??'agi «lggif«? li i big! ^^^^ 

?^gl"S= =rc"r="? i --r,'-^ llHtlllll acafgS 

rc-t-ttcgt ctgggtgcgg aggagctcc^ ^^^^^=,rcta ttccctgcct ctgagcatct 72U 

?«"cg"<^ EtccCMtoc ?«"?tgog gSctgtgctc caaeaggcgo 900 

?i;r=gSgl IScSS^ liilT.lT. Iir^cc?!? lg«g,ca=c aa.gaga„g |.0 



gqgc ttga 

<210> 40 

<211> 1000 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<-^-^3> incyte ID No: 747zl42CBl 



itlllAU gggaatagt. =t,g„ga.. ....|atg.. a«|a|||- 1^0 

ag?=g^l"r= ?^S?S?=a gcl^.g'S" |1. Ijgjca .=,c.c«gg gc.ac.gc.c 

r?t?lttctg atcactatcc tagattcacg -^^^taaaacc ^^^9 tgctggtgaa 3 00 

aaatctctc? ttcacggata tttgttacac atctacctct .tt^^^^^^^ tgta.ctgtc 360 

cttgctgtca tcccagaaaa =^^"atctt atggcatatg accgttatgt 420 

-"^r.?a? Ifccrgcngl T^Tclll ^catlltgaac aggtgcgtct gtgcacggat 480 
llTcTalTc ^ccfg'ggt^a cgggttgcct gaccgctc.g cjggaaac^ ^.cg^tgct 600 

Sf.agr. ^.rca?afg?t |c |c 
r.rgaSfcc l^c^cfg^lg ^ caaggcc.. .c.acc.g.. ...ccca. 

giclg^ggtg attttgtatt atggggctgc cc^ctct^^g ^^^^g^,,, ^^""^f,^" 960 
aaatgcacaa -aatagaca aaatcatctc S^J^^^^^^ gatgctatga agaaat.gct 960 
gaaccccata atctacagtc « acatctctga 



|;rfaS"a a^a'etgS^c iarcar.Jg. :S..c.c„. 

<210> 41 
<211> 1008 
<212> DNA 
<213> Homo sapiens 

<220> 

<^03> incyte ID No: 7472171CB1 



life ss sss iili SI ill | 

m ill S3 sES 53SS SSS h: 



30/3 1 



BNSDOCID <WO . 0^57095A2 



aggtaccacg 


tgc tcargag 


ccaccggc t c 




ctccgcctcc 


tcaagccggt 


gacrgagatg 






t cag t, cac t w 


cacccgcaag 


a t 


aacacg -egg 


tcagcgaaga 


Gttcctgctg 


gc 


ctggcatt =a 


tctgcctgtc 


ctacttgctc 


at 


gccgccaggt 


gc tgcaaagc 


ct tc tccacc 


tg 


ttctacggca 


ccatcacct t 


catgtac t tg 


aa 


gatgaggt-t 


tcacagtcc t 


ctatgcca tg 


gt 


agcc tgagga 


acaaggaggt 


gaaggaggcc 


gc 


t ::cagtgagg 


gagggcgggg 


etc tgtacag 


ac 


<210> 42 








<211> 972 








<212> DNA 








<213> Homo 


sapiens 







PCT/LSOl/034 

cgtgctgc tgatgggagc tgcctgggtc 430 

catctcca tgaggc tgcc c ttctgtggc S4 0 

cc tggcag tgczgaagct ggcatgcggc 6 0 0 

gggctcca tcctyctgct gcctgtaccc 660 

cctggcca cca^^cc tgag ggtgccctcg 720 

cttggcac acctggctgc agtgctgctt 730 

gcccaaga g taaggaagc ccacatctct S40 

cacgacca tgctgaaccc caccatctac 9 00 

caggaagg tgtggggcag gagtcgggcc 960 

gcaggtct caggttag 1008 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7472172CB1 



<400> 42 

ggaaaccct g cccaccatat 
accc taattg cot tec cage 
gtaac ttaca ca'ttaacagc 
catcgcctgc aaac tccaat 
tacaccactg tcattacccc 
tcttt tgctg gttgcatgat 
atcc tc ttgg cggtgatgtc 
acggtcatca tgaacagcag 
ttcc tgtctg tgttgtttcc 
attaatcatt tcttctgtga 
c teat tgaga agataaact t 
ac tac tgggt cctacgtgta 
cgteagaaag etttttctac 
ageaacatc t ttgtgtatgt 
gccgctgtcc teatcacagt 
aatgagaagg tacaggaagt 
aggaaaactt ga 



agtagttgtc atgggaaaet 
tc tee tggag at tcgaatat 
aacaggaaac atcaceatca 
gtacttc ttc etcagtaat t 
aaagttgttg gcc tgcc tec 
cc aaacatat ttctacttct 
ctttgaccgc tacatggcta 
ggectgcct t ctgctggttc 
aaccattgta gtgacaaggc 
cattgcccct cttcttcagg 
tctcctctct gccettgtca 
cataatttc t accatcc tgc 
ctgtgcttc t cacatcactg 
gagacccaat cagaac tec t 
ggtgacccc t etcc tgaacc 
gttgagagag acagtgaaca 



ggageaetgt gactgaaatc 6 0 
ctctcttcgt ggttcttgtg 12 0 
tetccctgat atggattgat 180 
tgtcctttct ggatatctta 24 0 
taggagaaga gaaaaccata 3 00 
ttctggggac ggtggagttt 3 60 
tctgcgaccc ac tgcaetac 4 2 0 
tggga tgctg ggtgggagcc 480 
tacct tactg taggaaagaa 540 
tggcctgtat aaatac teae 600 
tcctgagctc cctggcattc 660 
gtatcccctc cacccaggge 720 
ttgtetccat tgceeacggg 7 80 
cactggatta tgacaaggtg 84 0 
cttttatcta cagettgagg 900 
gaatcatgac cttgatacaa 960 

972 
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